ubuntu-2010-onlogic

Intel Xeon E-2278GEL testing with a Logic Supply RXM-181 (Z01-0001A027 BIOS) and Intel UHD P630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102011-HA-UBUNTU20190&sor&grs.

ubuntu-2010-onlogicProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution11a23Intel Xeon E-2278GEL @ 3.90GHz (8 Cores / 16 Threads)Logic Supply RXM-181 (Z01-0001A027 BIOS)Intel Cannon Lake PCH16GB512GB TS512GMTE510TIntel UHD P630 3GB (1150MHz)Realtek ALC233DELL P2415QIntel I219-LM + 2 x Intel I210Ubuntu 20.105.8.0-41-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.9intel4.6 Mesa 20.2.61.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

ubuntu-2010-onlogicredis: LPOPonednn: IP Shapes 1D - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUredis: GETonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUnpb: EP.Cncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - regnety_400monednn: Convolution Batch Shapes Auto - f32 - CPUdav1d: Chimera 1080ponednn: Recurrent Neural Network Inference - f32 - CPUclomp: Static OMP Speedupnumpy: onednn: Recurrent Neural Network Inference - u8s8f32 - CPUncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: CPU - blazefacencnn: CPU - shufflenet-v2ncnn: Vulkan GPU - shufflenet-v2espeak: Text-To-Speech Synthesisonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUnpb: EP.Donednn: Convolution Batch Shapes Auto - u8s8f32 - CPUbuild-eigen: Time To Compilebuild-ffmpeg: Time To Compileonednn: Recurrent Neural Network Training - f32 - CPUcython-bench: N-Queensonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUvkfft: ncnn: CPU - efficientnet-b0crafty: Elapsed Timesimdjson: DistinctUserIDncnn: CPU-v3-v3 - mobilenet-v3vkmark: 1920 x 1080onednn: IP Shapes 3D - f32 - CPUqe: AUSURF112dav1d: Summer Nature 1080ponnx: super-resolution-10 - OpenMP CPUbrl-cad: VGR Performance Metriconednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicsqmcpack: simple-H2Osqlite-speedtest: Timed Time - Size 1,000lzbench: Zstd 8 - Compressionncnn: CPU - mnasnetcompress-lz4: 1 - Compression Speedlzbench: Brotli 2 - Compressionaskap: tConvolve OpenMP - Griddingunpack-firefox: firefox-84.0.source.tar.xzrav1e: 10ncnn: Vulkan GPU - mnasnettnn: CPU - MobileNet v2lzbench: Crush 0 - Compressionmnn: SqueezeNetV1.0embree: Pathtracer - Crownrav1e: 5webp2: Defaultaskap: tConvolve MPI - Griddingbasis: UASTC Level 2npb: CG.Cdav1d: Summer Nature 4Kbuild2: Time To Compilencnn: Vulkan GPU - blazefacecompress-lz4: 3 - Compression Speedrav1e: 1embree: Pathtracer ISPC - Crowncompress-lz4: 9 - Compression Speedlzbench: Brotli 0 - Compressionetcpak: ETC2vkresample: 2x - Singleembree: Pathtracer ISPC - Asian Dragonaskap: Hogbom Clean OpenMPcompress-lz4: 1 - Decompression Speedgcrypt: mnn: resnet-v2-50cryptsetup: AES-XTS 512b Encryptionredis: SADDbuild-godot: Time To Compilencnn: Vulkan GPU - efficientnet-b0cryptsetup: AES-XTS 256b Decryptionmnn: MobileNetV2_224mnn: inception-v3mafft: Multiple Sequence Alignment - LSU RNAncnn: Vulkan GPU - resnet18cryptsetup: AES-XTS 512b Decryptionlzbench: Zstd 8 - Decompressionembree: Pathtracer - Asian Dragonaskap: tConvolve MPI - Degriddingetcpak: DXT1lzbench: Libdeflate 1 - Compressionncnn: Vulkan GPU - googlenetlzbench: Zstd 1 - Compressionwebp2: Quality 75, Compression Effort 7mnn: mobilenet-v1-1.0embree: Pathtracer ISPC - Asian Dragon Objetcpak: ETC1 + Ditheringnode-web-tooling: gromacs: Water Benchmarkncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - resnet18coremark: CoreMark Size 666 - Iterations Per Secondvkresample: 2x - Doublelzbench: Brotli 0 - Decompressioncompress-zstd: 3encode-ape: WAV To APEnpb: MG.Clammps: Rhodopsin Proteincryptsetup: AES-XTS 256b Encryptionquantlib: cryptsetup: Serpent-XTS 512b Encryptionncnn: CPU - alexnetembree: Pathtracer - Asian Dragon Objncnn: Vulkan GPU - vgg16kripke: cryptsetup: Serpent-XTS 256b Decryptionastcenc: Thoroughfinancebench: Bonds OpenMPncnn: Vulkan GPU - resnet50basis: ETC1Srav1e: 6etcpak: ETC1lzbench: Zstd 1 - Decompressionncnn: Vulkan GPU - mobilenetcryptsetup: Serpent-XTS 512b Decryptionncnn: CPU - googlenetcryptsetup: Serpent-XTS 256b Encryptioncompress-lz4: 9 - Decompression Speedencode-opus: WAV To Opus Encodelzbench: Brotli 2 - Decompressioncryptsetup: Twofish-XTS 256b Decryptionncnn: CPU - yolov4-tinyaskap: tConvolve OpenMP - Degriddingnpb: LU.Castcenc: Fastwebp2: Quality 95, Compression Effort 7phpbench: PHP Benchmark Suitecp2k: Fayalite-FIST Dataai-benchmark: Device Inference Scorebasis: UASTC Level 2 + RDO Post-Processingncnn: CPU - vgg16indigobench: CPU - Supercartnn: CPU - SqueezeNet v1.1onnx: shufflenet-v2-10 - OpenMP CPUdav1d: Chimera 1080p 10-bitcryptsetup: Twofish-XTS 512b Encryptionwebp2: Quality 100, Compression Effort 5ncnn: Vulkan GPU - yolov4-tinyaskap: tConvolve MT - Griddingopenfoam: Motorbike 30Mindigobench: CPU - Bedroomnpb: FT.Ccryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 512b Decryptionai-benchmark: Device AI Scorencnn: CPU - squeezenet_ssdastcenc: Exhaustivebasis: UASTC Level 3askap: tConvolve MT - Degriddingwebp2: Quality 100, Lossless Compressionhmmer: Pfam Database Searchcompress-lz4: 3 - Decompression Speedcryptsetup: PBKDF2-sha512lulesh: ncnn: Vulkan GPU - alexnetamg: financebench: Repo OpenMPcryptsetup: PBKDF2-whirlpoolncnn: CPU - mobilenetncnn: Vulkan GPU - squeezenet_ssdredis: LPUSHbasis: UASTC Level 0redis: SETsynthmark: VoiceMark_100encode-wavpack: WAV To WavPackncnn: CPU - resnet50ai-benchmark: Device Training Scoreonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUastcenc: Mediumcompress-zstd: 19simdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyalzbench: Crush 0 - Decompressionlzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionwarsow: 1920 x 1080ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ncnn: CPU - regnety_400monednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPU11a231273501.90862.9209.3192.072827879.758.5278510.26875.319394.410112703703.337.595451017.825.7817.9033.3155436.173508.552.9329.043472.527.762.446.646.6730.8153469.63870.7031.781789.90786.5876006.3425.8214.6489111763649189475986000.836004.0612969.9873558770.615.7752518.35822191.15427.623243685617.99169300.0538.53462.031815.775218.11168690.01721.4073.1585.76372.0391017.9776.95711.0965.5161155.9748.5932905.81101.12231.1622.4544.530.3867.687443.44416165.242498.5328.9844104.6406003.6229.82042.4712784.82261333.33228.92710.003394.04.61648.21212.58524.282783.617437.99111284.261208.89221322.36452392.3124.0967.9852280.77110.390.5197.8124.25243609.1158611004.3895621611.212.7705680.845.0023387.62180.5737.621.117.3402114.0317521143723.946.7871041.96875043.8159.9511.445299.463160727.56723.822.29736.35990.09.606650404.040.921161.0015451.707.13724.5176551651478.055743808.351113.972.509343.5741415688.15400.921.34640.95613.317464.541.0897365.58400.8403.7143028.66387.7195.508993.0301445.403131.2655983.515887512738.352521.1212237216749677.13802166733127.5528.691718046.719.2281980808.83615.63816.69343.67687453972616.2312.50.60.370.484501053817.888.291132.905091729976.596.852068.130494.617623.990592565612.757.23123983.136.0418.6132.1370446.953402.403.0339.743402.038.002.516.836.8631.6413382.74860.2731.034487.79188.4275899.1226.3274.5599611662466186578045902.685896.89128510.1574624640.615.8651818.37662164.20433.953197694887.88645303.9738.04961.254825.845224.18170682.72921.6353.1875.82368.3041027.9016.92721.1025.4911146.1148.1892881.77100.75230.3362.4744.840.3857.746643.74419164.078500.9118.9358103.9506043.1229.30842.2072767.82247597.67228.75510.063373.84.58947.93412.60624.152769.217487.97861278.111203.14921422.26451392.4074.0797.9535279.65810.350.5217.8424.16244321.4829321005.9325641605.612.8145669.094.9943377.02185.0735.521.057.3418113.7217477813722.246.8971207.26562543.7160.0801.442298.894160727.51722.522.25735.05983.29.602650403.440.981159.3215459.457.12723.5226542721478.482742807.286113.822.512343.2091414188.24400.521.36740.99613.906464.941.0907371.11400.5403.4142928.68387.9795.569992.4131446.293131.3095980.315879502738.006621.1112231703349656.07291766760627.5428.701717450.179.2311981355.17615.80216.68943.68687453972616.2312.50.60.370.484501053819.347.669422.625106.645368.714964.669444.102287.208521032.0532.1167451.883387.303.0328.973363.6730.9613382.38881.3431.026289.59187.3645884.7625.8334.5705011884731186016045893.195898.04128774809330.6251718.09862158.80433.447.93690303.6738.453825162.571693.1541016.89251.0925.4662901.19101.58232.22244.880.3837.740843.76417165.255500.6858.99526034.7230.795227.55512.53617398.01941208.664214450394.0137.9747280.583244509.6366681007.9945621606.612.7755661.405.0112187.17.36111.443299.21516105980.19.61765115474.021480.06688.20464.977369.78131.2355981.82739.312812233446712.50.60.370.48450105387.814822.65725OpenBenchmarking.org

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time1510152025Min: 3.69 / Avg: 4.77 / Max: 18.371. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time1510152025Min: 8.52 / Avg: 10.85 / Max: 18.851. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1a2600K1200K1800K2400K3000KSE +/- 6710.43, N = 3SE +/- 6466.66, N = 32827879.751729976.591. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU321a246810SE +/- 0.04844, N = 15SE +/- 0.07900, N = 15SE +/- 0.11268, N = 36.645366.852068.52785MIN: 5.91MIN: 6.11MIN: 8.221. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU231a3691215SE +/- 0.02351, N = 3SE +/- 0.13403, N = 3SE +/- 0.04090, N = 38.130498.7149610.26870MIN: 8.05MIN: 8.53MIN: 10.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU231a1.19692.39383.59074.78765.9845SE +/- 0.01117, N = 3SE +/- 0.02611, N = 3SE +/- 0.03383, N = 34.617624.669445.31939MIN: 4.49MIN: 4.56MIN: 5.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU231a0.99231.98462.97693.96924.9615SE +/- 0.00961, N = 3SE +/- 0.02669, N = 3SE +/- 0.03942, N = 33.990594.102284.41011MIN: 3.92MIN: 3.98MIN: 4.141. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1a2600K1200K1800K2400K3000KSE +/- 17672.66, N = 3SE +/- 10642.86, N = 32703703.332565612.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU321a246810SE +/- 0.00510, N = 3SE +/- 0.02083, N = 3SE +/- 0.02011, N = 37.208527.231237.59545MIN: 7.12MIN: 7.1MIN: 7.471. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C31a22004006008001000SE +/- 1.30, N = 3SE +/- 16.69, N = 3SE +/- 1.43, N = 31032.051017.82983.131. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31a2246810SE +/- 0.03, N = 3SE +/- 0.19, N = 35.786.04MIN: 5.59 / MAX: 6.95MIN: 5.6 / MAX: 7.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1a2510152025SE +/- 0.22, N = 3SE +/- 0.08, N = 317.9018.61MIN: 17.15 / MAX: 18.95MIN: 17.25 / MAX: 201. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU321a816243240SE +/- 0.08, N = 3SE +/- 0.09, N = 3SE +/- 0.39, N = 332.1232.1433.32MIN: 31.91MIN: 31.94MIN: 32.41. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p321a100200300400500SE +/- 1.05, N = 3SE +/- 1.54, N = 3SE +/- 0.72, N = 3451.88446.95436.17MIN: 337.72 / MAX: 664.93MIN: 335.55 / MAX: 660.93MIN: 338.67 / MAX: 653.631. (CC) gcc options: -pthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU321a8001600240032004000SE +/- 6.49, N = 3SE +/- 1.31, N = 3SE +/- 12.64, N = 33387.303402.403508.55MIN: 3338.56MIN: 3364.88MIN: 3442.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup321a0.6751.352.0252.73.375SE +/- 0.03, N = 3SE +/- 0.03, N = 9SE +/- 0.03, N = 33.03.02.91. (CC) gcc options: -fopenmp -O3 -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark21a370140210280350SE +/- 0.16, N = 3SE +/- 0.25, N = 3SE +/- 0.58, N = 3339.74329.04328.97

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU321a7001400210028003500SE +/- 1.67, N = 3SE +/- 7.92, N = 3SE +/- 42.16, N = 33363.673402.033472.52MIN: 3322.79MIN: 3340.17MIN: 3371.231. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.03, N = 3SE +/- 0.14, N = 37.768.00MIN: 7.45 / MAX: 9.82MIN: 7.65 / MAX: 9.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1a20.56481.12961.69442.25922.824SE +/- 0.06, N = 3SE +/- 0.01, N = 32.442.51MIN: 2.28 / MAX: 2.98MIN: 2.35 / MAX: 3.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21a2246810SE +/- 0.09, N = 3SE +/- 0.11, N = 36.646.83MIN: 6.5 / MAX: 8.16MIN: 6.65 / MAX: 7.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21a2246810SE +/- 0.08, N = 3SE +/- 0.18, N = 36.676.86MIN: 6.51 / MAX: 8.11MIN: 6.5 / MAX: 8.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1a32714212835SE +/- 0.44, N = 4SE +/- 0.19, N = 4SE +/- 0.22, N = 2030.8230.9631.641. (CC) gcc options: -O2 -std=c99

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU321a7001400210028003500SE +/- 14.91, N = 3SE +/- 13.00, N = 3SE +/- 27.44, N = 33382.383382.743469.63MIN: 3321.67MIN: 3319.45MIN: 3389.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D31a22004006008001000SE +/- 13.60, N = 3SE +/- 10.93, N = 12SE +/- 8.64, N = 12881.34870.70860.271. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU321a714212835SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.18, N = 331.0331.0331.78MIN: 30.84MIN: 30.81MIN: 31.281. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile231a20406080100SE +/- 0.03, N = 3SE +/- 0.11, N = 3SE +/- 0.01, N = 387.7989.5989.91

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1a3220406080100SE +/- 0.42, N = 3SE +/- 0.24, N = 3SE +/- 1.19, N = 386.5987.3688.43

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU321a13002600390052006500SE +/- 6.65, N = 3SE +/- 3.24, N = 3SE +/- 16.93, N = 35884.765899.126006.34MIN: 5829.64MIN: 5840.53MIN: 5936.31. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cython Benchmark

Test: N-Queens

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens1a32612182430SE +/- 0.03, N = 3SE +/- 0.09, N = 3SE +/- 0.25, N = 1525.8225.8326.33

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU231a1.0462.0923.1384.1845.23SE +/- 0.04208, N = 10SE +/- 0.04478, N = 9SE +/- 0.04669, N = 84.559964.570504.64891MIN: 3.76MIN: 3.81MIN: 3.941. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time31a23M6M9M12M15MSE +/- 160953.33, N = 4SE +/- 37547.46, N = 3SE +/- 47603.72, N = 31188473111763649116624661. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1a234M8M12M16M20MSE +/- 239170.10, N = 3SE +/- 47854.09, N = 3SE +/- 209621.37, N = 3189475981865780418601604

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU321a13002600390052006500SE +/- 4.88, N = 3SE +/- 13.63, N = 3SE +/- 19.70, N = 35893.195902.686000.83MIN: 5838.51MIN: 5841.04MIN: 5915.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU231a13002600390052006500SE +/- 6.73, N = 3SE +/- 12.51, N = 3SE +/- 18.98, N = 35896.895898.046004.06MIN: 5837.81MIN: 5828.75MIN: 5933.151. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.11a32130060090012001500SE +/- 0.67, N = 3SE +/- 0.58, N = 3SE +/- 1.76, N = 312961287128512731. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01a23691215SE +/- 0.14, N = 3SE +/- 0.02, N = 39.9810.15MIN: 9.65 / MAX: 19.45MIN: 9.86 / MAX: 10.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time321a1.6M3.2M4.8M6.4M8MSE +/- 8686.26, N = 3SE +/- 26469.09, N = 3SE +/- 25940.20, N = 37480933746246473558771. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID321a0.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.620.610.611. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31a21.31852.6373.95555.2746.5925SE +/- 0.03, N = 3SE +/- 0.04, N = 35.775.86MIN: 5.57 / MAX: 7.01MIN: 5.62 / MAX: 7.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801a23110220330440550SE +/- 2.40, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 35255185171. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU31a2510152025SE +/- 0.05, N = 3SE +/- 0.15, N = 3SE +/- 0.04, N = 318.1018.3618.38MIN: 17.7MIN: 17.58MIN: 17.991. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF112321a5001000150020002500SE +/- 6.30, N = 3SE +/- 13.36, N = 3SE +/- 20.55, N = 32158.802164.202191.151. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p231a90180270360450SE +/- 0.48, N = 3SE +/- 1.00, N = 3SE +/- 0.99, N = 3433.95433.44427.62MIN: 372.19 / MAX: 467.89MIN: 370.03 / MAX: 468.99MIN: 366.19 / MAX: 4621. (CC) gcc options: -pthread

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1a27001400210028003500SE +/- 6.71, N = 3SE +/- 34.43, N = 3324331971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric21a15K30K45K60K75K69488685611. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU231a246810SE +/- 0.12374, N = 12SE +/- 0.12812, N = 12SE +/- 0.12949, N = 127.886457.936907.99169MIN: 6.33MIN: 6.33MIN: 6.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1a3270140210280350SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3300.05303.67303.971. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O231a918273645SE +/- 0.63, N = 3SE +/- 0.55, N = 4SE +/- 0.55, N = 438.0538.4538.531. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,00021a1428425670SE +/- 0.20, N = 3SE +/- 0.23, N = 361.2562.031. (CC) gcc options: -O2 -ldl -lz -lpthread

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression321a204060801008282811. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1a21.3142.6283.9425.2566.57SE +/- 0.10, N = 3SE +/- 0.10, N = 35.775.84MIN: 5.35 / MAX: 7.36MIN: 5.28 / MAX: 7.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed21a311002200330044005500SE +/- 3.55, N = 3SE +/- 1.32, N = 3SE +/- 4.86, N = 35224.185218.115162.571. (CC) gcc options: -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression231a40801201602001701691681. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1a2150300450600750SE +/- 9.06, N = 3SE +/- 2.68, N = 3690.02682.731. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz1a2510152025SE +/- 0.24, N = 7SE +/- 0.32, N = 421.4121.64

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 1021a30.71711.43422.15132.86843.5855SE +/- 0.027, N = 3SE +/- 0.004, N = 3SE +/- 0.002, N = 33.1873.1583.154

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1a21.30952.6193.92855.2386.5475SE +/- 0.11, N = 3SE +/- 0.12, N = 35.765.82MIN: 5.31 / MAX: 7.49MIN: 5.3 / MAX: 7.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v221a80160240320400SE +/- 0.47, N = 3SE +/- 2.01, N = 3368.30372.04MIN: 366.89 / MAX: 369.96MIN: 368.17 / MAX: 376.41. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression231a20406080100SE +/- 1.00, N = 3SE +/- 0.33, N = 31021011011. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.021a246810SE +/- 0.038, N = 3SE +/- 0.010, N = 37.9017.977MIN: 7.36 / MAX: 20.81MIN: 7.4 / MAX: 21.281. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1a23246810SE +/- 0.0934, N = 4SE +/- 0.1026, N = 4SE +/- 0.0890, N = 56.95716.92726.8925MIN: 6.68 / MAX: 8.51MIN: 6.63 / MAX: 8.55MIN: 6.62 / MAX: 8.55

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 521a30.2480.4960.7440.9921.24SE +/- 0.003, N = 3SE +/- 0.004, N = 3SE +/- 0.001, N = 31.1021.0961.092

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default321a1.24112.48223.72334.96446.2055SE +/- 0.030, N = 3SE +/- 0.020, N = 3SE +/- 0.029, N = 35.4665.4915.5161. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding1a22004006008001000SE +/- 5.07, N = 3SE +/- 12.47, N = 31155.971146.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 221a1122334455SE +/- 0.77, N = 3SE +/- 0.69, N = 448.1948.591. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a326001200180024003000SE +/- 1.91, N = 3SE +/- 2.44, N = 3SE +/- 0.89, N = 32905.812901.192881.771. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K31a220406080100SE +/- 0.36, N = 3SE +/- 0.28, N = 3SE +/- 0.37, N = 3101.58101.12100.75MIN: 87.94 / MAX: 106.99MIN: 91.52 / MAX: 106.08MIN: 83.92 / MAX: 106.221. (CC) gcc options: -pthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile21a350100150200250SE +/- 1.08, N = 3SE +/- 0.90, N = 3SE +/- 1.00, N = 3230.34231.16232.22

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1a20.55581.11161.66742.22322.779SE +/- 0.06, N = 3SE +/- 0.05, N = 32.452.47MIN: 2.29 / MAX: 2.85MIN: 2.31 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed321a1020304050SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 344.8844.8444.531. (CC) gcc options: -O3

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11a230.08690.17380.26070.34760.4345SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3860.3850.383

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown231a246810SE +/- 0.0480, N = 3SE +/- 0.0611, N = 3SE +/- 0.1307, N = 37.74667.74087.6874MIN: 7.5 / MAX: 9.39MIN: 7.48 / MAX: 9.45MIN: 7.27 / MAX: 9.52

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed321a1020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 343.7643.7443.441. (CC) gcc options: -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression231a90180270360450SE +/- 0.33, N = 34194174161. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC231a24080120160200SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.90, N = 3165.26165.24164.081. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single1a321110220330440550SE +/- 0.33, N = 3SE +/- 0.25, N = 3SE +/- 0.18, N = 3SE +/- 0.26, N = 3498.53500.69500.91501.911. (CXX) g++ options: -O3 -pthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon31a23691215SE +/- 0.0858, N = 3SE +/- 0.0549, N = 3SE +/- 0.0413, N = 38.99528.98448.9358MIN: 8.75 / MAX: 9.95MIN: 8.77 / MAX: 9.8MIN: 8.75 / MAX: 9.83

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1a220406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 3104.64103.951. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed231a13002600390052006500SE +/- 0.40, N = 3SE +/- 2.77, N = 3SE +/- 5.95, N = 36043.16034.76003.61. (CC) gcc options: -O3

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.921a350100150200250SE +/- 0.19, N = 3SE +/- 0.49, N = 3SE +/- 0.52, N = 3229.31229.82230.801. (CC) gcc options: -O2 -fvisibility=hidden

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-5021a1020304050SE +/- 0.03, N = 3SE +/- 0.06, N = 342.2142.47MIN: 40.94 / MAX: 44.99MIN: 41.07 / MAX: 45.741. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1a26001200180024003000SE +/- 4.28, N = 3SE +/- 12.45, N = 32784.82767.8

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1a2500K1000K1500K2000K2500KSE +/- 6947.45, N = 3SE +/- 3348.75, N = 32261333.332247597.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile321a50100150200250SE +/- 0.14, N = 3SE +/- 0.12, N = 3SE +/- 1.32, N = 3227.56228.76228.93

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01a23691215SE +/- 0.13, N = 3SE +/- 0.14, N = 310.0010.06MIN: 9.67 / MAX: 11.41MIN: 9.74 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1a27001400210028003500SE +/- 8.42, N = 3SE +/- 18.72, N = 33394.03373.8

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_22421a1.03862.07723.11584.15445.193SE +/- 0.008, N = 3SE +/- 0.014, N = 34.5894.616MIN: 4 / MAX: 5.39MIN: 4 / MAX: 5.381. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v321a1122334455SE +/- 1.29, N = 3SE +/- 1.11, N = 347.9348.21MIN: 39.27 / MAX: 62.11MIN: 40.03 / MAX: 87.291. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA31a23691215SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.10, N = 312.5412.5912.611. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet1821a612182430SE +/- 0.07, N = 3SE +/- 0.02, N = 324.1524.28MIN: 23.68 / MAX: 25.72MIN: 23.77 / MAX: 32.561. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1a26001200180024003000SE +/- 4.37, N = 3SE +/- 10.39, N = 32783.62769.2

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression21a3400800120016002000SE +/- 11.50, N = 3SE +/- 8.19, N = 31748174317391. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon31a2246810SE +/- 0.0068, N = 3SE +/- 0.0156, N = 3SE +/- 0.0266, N = 38.01947.99117.9786MIN: 7.79 / MAX: 9.01MIN: 7.78 / MAX: 9.04MIN: 7.81 / MAX: 8.99

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1a230060090012001500SE +/- 8.33, N = 3SE +/- 11.46, N = 31284.261278.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11a3230060090012001500SE +/- 1.99, N = 3SE +/- 1.26, N = 3SE +/- 0.69, N = 31208.891208.661203.151. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression321a501001502002502142142131. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet21a510152025SE +/- 0.36, N = 3SE +/- 0.23, N = 322.2622.36MIN: 21.15 / MAX: 23.73MIN: 21.06 / MAX: 23.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1a23100200300400500SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.53, N = 34524514501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 71a2390180270360450SE +/- 1.12, N = 3SE +/- 1.91, N = 3SE +/- 1.58, N = 3392.31392.41394.011. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.021a0.92161.84322.76483.68644.608SE +/- 0.012, N = 3SE +/- 0.008, N = 34.0794.096MIN: 3.87 / MAX: 16.37MIN: 3.86 / MAX: 17.271. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1a32246810SE +/- 0.0235, N = 3SE +/- 0.0138, N = 3SE +/- 0.0122, N = 37.98527.97477.9535MIN: 7.77 / MAX: 8.68MIN: 7.77 / MAX: 8.72MIN: 7.72 / MAX: 8.73

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1a3260120180240300SE +/- 0.11, N = 3SE +/- 0.04, N = 3SE +/- 0.99, N = 3280.77280.58279.661. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1a23691215SE +/- 0.08, N = 3SE +/- 0.11, N = 310.3910.351. Nodejs v12.18.2

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark21a0.11720.23440.35160.46880.586SE +/- 0.001, N = 3SE +/- 0.002, N = 30.5210.5191. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.02, N = 3SE +/- 0.02, N = 37.817.84MIN: 7.6 / MAX: 9.81MIN: 7.59 / MAX: 9.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet1821a612182430SE +/- 0.09, N = 3SE +/- 0.01, N = 324.1624.25MIN: 23.71 / MAX: 25.08MIN: 23.71 / MAX: 26.651. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second321a50K100K150K200K250KSE +/- 2406.51, N = 9SE +/- 2311.32, N = 10SE +/- 1679.06, N = 14244509.64244321.48243609.121. (CC) gcc options: -O2 -lrt" -lrt

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1a232004006008001000SE +/- 3.58, N = 3SE +/- 3.72, N = 3SE +/- 4.35, N = 31004.391005.931007.991. (CXX) g++ options: -O3 -pthread

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression231a120240360480600SE +/- 0.33, N = 3SE +/- 0.88, N = 35645625621. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31a3230060090012001500SE +/- 1.09, N = 3SE +/- 5.21, N = 3SE +/- 3.28, N = 31611.21606.61605.61. (CC) gcc options: -O3 -pthread -lz -llzma

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1a323691215SE +/- 0.04, N = 5SE +/- 0.07, N = 5SE +/- 0.04, N = 512.7712.7812.811. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a2312002400360048006000SE +/- 4.97, N = 3SE +/- 1.64, N = 3SE +/- 9.93, N = 35680.845669.095661.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein31a21.12752.2553.38254.515.6375SE +/- 0.004, N = 3SE +/- 0.016, N = 3SE +/- 0.013, N = 35.0115.0024.9941. (CXX) g++ options: -O3 -pthread -lm

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1a27001400210028003500SE +/- 1.88, N = 3SE +/- 20.76, N = 33387.63377.0

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.21321a5001000150020002500SE +/- 17.07, N = 3SE +/- 14.18, N = 3SE +/- 18.57, N = 32187.12185.02180.51. (CXX) g++ options: -O3 -march=native -rdynamic

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1a2160320480640800SE +/- 0.83, N = 3737.6735.5

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet21a510152025SE +/- 0.06, N = 3SE +/- 0.02, N = 321.0521.11MIN: 20.87 / MAX: 21.63MIN: 20.83 / MAX: 21.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj321a246810SE +/- 0.0029, N = 3SE +/- 0.0153, N = 3SE +/- 0.0159, N = 37.36117.34187.3402MIN: 7.12 / MAX: 8.18MIN: 7.13 / MAX: 8.13MIN: 7.12 / MAX: 8.12

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg1621a306090120150SE +/- 0.06, N = 3SE +/- 0.12, N = 3113.72114.03MIN: 113.39 / MAX: 122.08MIN: 113.58 / MAX: 123.461. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41a24M8M12M16M20MSE +/- 43251.00, N = 3SE +/- 22002.07, N = 317521143174778131. (CXX) g++ options: -O3 -fopenmp

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1a2160320480640800SE +/- 0.38, N = 3SE +/- 0.94, N = 3723.9722.2

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1a21122334455SE +/- 0.50, N = 3SE +/- 0.49, N = 346.7846.891. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1a215K30K45K60K75KSE +/- 217.85, N = 3SE +/- 331.16, N = 371041.9771207.271. (CXX) g++ options: -O3 -march=native -fopenmp

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet5021a1020304050SE +/- 0.05, N = 3SE +/- 0.07, N = 343.7143.81MIN: 41.74 / MAX: 46.8MIN: 41.64 / MAX: 53.861. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1a21326395265SE +/- 0.33, N = 3SE +/- 0.31, N = 359.9560.081. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61a320.32510.65020.97531.30041.6255SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.005, N = 31.4451.4431.442

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11a3270140210280350SE +/- 0.06, N = 3SE +/- 0.33, N = 3SE +/- 0.37, N = 3299.46299.22298.891. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression321a30060090012001500SE +/- 1.76, N = 3SE +/- 2.89, N = 3SE +/- 3.61, N = 31610160716071. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet21a612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 327.5127.56MIN: 27.14 / MAX: 29.4MIN: 27.18 / MAX: 28.231. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1a2160320480640800SE +/- 0.50, N = 3SE +/- 1.30, N = 2723.8722.5

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet21a510152025SE +/- 0.35, N = 3SE +/- 0.27, N = 322.2522.29MIN: 21.23 / MAX: 23.53MIN: 21.06 / MAX: 23.61. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1a2160320480640800SE +/- 0.66, N = 3SE +/- 0.32, N = 3736.3735.0

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1a2313002600390052006500SE +/- 3.49, N = 3SE +/- 0.86, N = 3SE +/- 1.13, N = 35990.05983.25980.11. (CC) gcc options: -O3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode21a33691215SE +/- 0.014, N = 5SE +/- 0.014, N = 5SE +/- 0.013, N = 59.6029.6069.6171. (CXX) g++ options: -fvisibility=hidden -logg -lm

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression321a140280420560700SE +/- 0.58, N = 36516506501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.34, N = 3404.0403.4

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.05, N = 340.9240.98MIN: 40.49 / MAX: 49.83MIN: 40.54 / MAX: 50.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1a22004006008001000SE +/- 1.69, N = 3SE +/- 1.69, N = 31161.001159.321. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C321a3K6K9K12K15KSE +/- 16.88, N = 3SE +/- 11.20, N = 3SE +/- 6.60, N = 315474.0215459.4515451.701. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast21a246810SE +/- 0.00, N = 3SE +/- 0.03, N = 37.127.131. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 721a160320480640800SE +/- 1.84, N = 3SE +/- 0.77, N = 3723.52724.521. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1a2140K280K420K560K700KSE +/- 1381.24, N = 3SE +/- 1621.04, N = 3655165654272

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data1a23300600900120015001478.061478.481480.07

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score1a2160320480640800743742

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing21a2004006008001000SE +/- 0.31, N = 3SE +/- 0.28, N = 3807.29808.351. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg1621a306090120150SE +/- 0.04, N = 3SE +/- 0.05, N = 3113.82113.97MIN: 113.45 / MAX: 122.83MIN: 113.51 / MAX: 123.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar21a0.56521.13041.69562.26082.826SE +/- 0.002, N = 3SE +/- 0.004, N = 32.5122.509

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.121a70140210280350SE +/- 0.09, N = 3SE +/- 0.07, N = 3343.21343.57MIN: 342.85 / MAX: 343.84MIN: 343.23 / MAX: 344.211. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1a23K6K9K12K15KSE +/- 22.47, N = 3SE +/- 34.58, N = 314156141411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit231a20406080100SE +/- 0.08, N = 3SE +/- 0.11, N = 3SE +/- 0.04, N = 388.2488.2088.15MIN: 57.38 / MAX: 196.19MIN: 57.19 / MAX: 197.38MIN: 57.28 / MAX: 196.211. (CC) gcc options: -pthread

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1a290180270360450SE +/- 0.12, N = 3SE +/- 0.31, N = 3400.9400.5

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51a2510152025SE +/- 0.21, N = 8SE +/- 0.23, N = 721.3521.371. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 340.9540.99MIN: 40.51 / MAX: 41.79MIN: 40.54 / MAX: 49.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding21a130260390520650SE +/- 0.37, N = 3SE +/- 0.23, N = 3613.91613.321. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1a23100200300400500SE +/- 0.37, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3464.54464.94464.971. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom21a0.24530.49060.73590.98121.2265SE +/- 0.001, N = 3SE +/- 0.000, N = 31.0901.089

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C231a16003200480064008000SE +/- 9.46, N = 3SE +/- 14.95, N = 3SE +/- 10.76, N = 37371.117369.787365.581. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1a290180270360450SE +/- 0.32, N = 3SE +/- 0.38, N = 3400.8400.5

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.40, N = 2403.7403.4

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score1a23006009001200150014301429

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1a2714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 328.6628.68MIN: 28.25 / MAX: 29.49MIN: 28.3 / MAX: 29.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1a280160240320400SE +/- 0.40, N = 3SE +/- 0.31, N = 3387.71387.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31a220406080100SE +/- 0.31, N = 3SE +/- 0.30, N = 395.5195.571. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1a22004006008001000SE +/- 0.27, N = 3SE +/- 0.15, N = 3993.03992.411. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1a230060090012001500SE +/- 1.95, N = 3SE +/- 1.82, N = 31445.401446.291. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search31a2306090120150SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.02, N = 3131.24131.27131.311. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1a3213002600390052006500SE +/- 1.02, N = 3SE +/- 0.40, N = 3SE +/- 1.62, N = 35983.55981.85980.31. (CC) gcc options: -O3

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121a2300K600K900K1200K1500KSE +/- 801.33, N = 315887511587950

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.331a26001200180024003000SE +/- 1.38, N = 3SE +/- 2.52, N = 3SE +/- 1.49, N = 32739.312738.352738.011. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet21a510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 321.1121.12MIN: 20.86 / MAX: 21.73MIN: 20.84 / MAX: 22.351. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21a3230M60M90M120M150MSE +/- 10051.26, N = 3SE +/- 8824.65, N = 3SE +/- 7169.92, N = 31223721671223344671223170331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP21a11K22K33K44K55KSE +/- 734.76, N = 3SE +/- 562.64, N = 349656.0749677.141. (CXX) g++ options: -O3 -march=native -fopenmp

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool21a140K280K420K560K700KSE +/- 1574.42, N = 3SE +/- 2257.33, N = 3667606667331

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet21a612182430SE +/- 0.02, N = 3SE +/- 0.03, N = 327.5427.55MIN: 27.2 / MAX: 30.25MIN: 27.17 / MAX: 28.551. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1a2714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 328.6928.70MIN: 28.24 / MAX: 29.73MIN: 28.3 / MAX: 29.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1a2400K800K1200K1600K2000KSE +/- 8997.19, N = 3SE +/- 3103.90, N = 31718046.711717450.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01a23691215SE +/- 0.005, N = 3SE +/- 0.005, N = 39.2289.2311. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET21a400K800K1200K1600K2000KSE +/- 8366.29, N = 3SE +/- 17544.22, N = 31981355.171980808.831. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_10021a130260390520650SE +/- 0.18, N = 3SE +/- 0.20, N = 3615.80615.641. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack21a48121620SE +/- 0.01, N = 5SE +/- 0.01, N = 516.6916.691. (CXX) g++ options: -rdynamic

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501a21020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 343.6743.68MIN: 41.66 / MAX: 47.83MIN: 41.61 / MAX: 45.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score21a150300450600750687687

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU21a1020304050SE +/- 0.17, N = 3SE +/- 0.00, N = 345451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU21a90180270360450SE +/- 1.15, N = 3SE +/- 1.26, N = 33973971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU21a60120180240300SE +/- 0.17, N = 3SE +/- 0.33, N = 32612611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1a2246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.236.231. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 19321a3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.512.512.51. (CC) gcc options: -O3 -pthread -lz -llzma

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets321a0.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom321a0.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya321a0.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression321a1002003004005004504504501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression321a20406080100SE +/- 0.33, N = 31051051051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression321a9182736453838381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 108011428425670SE +/- 0.37, N = 362.9

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap150100150200250SE +/- 0.86, N = 3209.31MIN: 54.45 / MAX: 280.91. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2120406080100SE +/- 0.23, N = 392.07MIN: 29.47 / MAX: 1231. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1a2510152025SE +/- 0.18, N = 3SE +/- 0.77, N = 317.8819.34MIN: 17.22 / MAX: 18.84MIN: 17.15 / MAX: 29.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU231a246810SE +/- 0.13886, N = 12SE +/- 0.15309, N = 12SE +/- 0.06721, N = 157.669427.814828.29113MIN: 5.68MIN: 5.76MIN: 7.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU231a0.65361.30721.96082.61443.268SE +/- 0.04605, N = 12SE +/- 0.04568, N = 12SE +/- 0.03413, N = 142.625102.657252.90509MIN: 2.08MIN: 2.1MIN: 2.261. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.4