ubuntu-2010-onlogic

Intel Xeon E-2278GEL testing with a Logic Supply RXM-181 (Z01-0001A027 BIOS) and Intel UHD P630 3GB on Ubuntu 20.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2102011-HA-UBUNTU20190&rdt&grs.

ubuntu-2010-onlogicProcessorMotherboardChipsetMemoryDiskGraphicsAudioMonitorNetworkOSKernelDesktopDisplay ServerDisplay DriverOpenGLVulkanCompilerFile-SystemScreen Resolution11a23Intel Xeon E-2278GEL @ 3.90GHz (8 Cores / 16 Threads)Logic Supply RXM-181 (Z01-0001A027 BIOS)Intel Cannon Lake PCH16GB512GB TS512GMTE510TIntel UHD P630 3GB (1150MHz)Realtek ALC233DELL P2415QIntel I219-LM + 2 x Intel I210Ubuntu 20.105.8.0-41-generic (x86_64)GNOME Shell 3.38.2X Server 1.20.9intel4.6 Mesa 20.2.61.2.145GCC 10.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,brig,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-targets=nvptx-none=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-10-JvwpWM/gcc-10-10.2.0/debian/tmp-gcn/usr,hsa --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: intel_pstate powersave - CPU Microcode: 0xde - Thermald 2.3Python Details- Python 3.8.6Security Details- itlb_multihit: KVM: Mitigation of VMX disabled + l1tf: Not affected + mds: Not affected + meltdown: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl and seccomp + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling + srbds: Mitigation of TSX disabled + tsx_async_abort: Mitigation of TSX disabled

ubuntu-2010-onlogicredis: LPOPonednn: IP Shapes 1D - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Deconvolution Batch shapes_3d - u8s8f32 - CPUonednn: IP Shapes 3D - u8s8f32 - CPUredis: GETonednn: Matrix Multiply Batch Shapes Transformer - f32 - CPUnpb: EP.Cncnn: Vulkan GPU-v3-v3 - mobilenet-v3ncnn: Vulkan GPU - regnety_400monednn: Convolution Batch Shapes Auto - f32 - CPUdav1d: Chimera 1080ponednn: Recurrent Neural Network Inference - f32 - CPUclomp: Static OMP Speedupnumpy: onednn: Recurrent Neural Network Inference - u8s8f32 - CPUncnn: Vulkan GPU-v2-v2 - mobilenet-v2ncnn: CPU - blazefacencnn: CPU - shufflenet-v2ncnn: Vulkan GPU - shufflenet-v2espeak: Text-To-Speech Synthesisonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUnpb: EP.Donednn: Convolution Batch Shapes Auto - u8s8f32 - CPUbuild-eigen: Time To Compilebuild-ffmpeg: Time To Compileonednn: Recurrent Neural Network Training - f32 - CPUcython-bench: N-Queensonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUstockfish: Total Timeasmfish: 1024 Hash Memory, 26 Depthonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - u8s8f32 - CPUvkfft: ncnn: CPU - efficientnet-b0crafty: Elapsed Timesimdjson: DistinctUserIDncnn: CPU-v3-v3 - mobilenet-v3vkmark: 1920 x 1080onednn: IP Shapes 3D - f32 - CPUqe: AUSURF112dav1d: Summer Nature 1080ponnx: super-resolution-10 - OpenMP CPUbrl-cad: VGR Performance Metriconednn: Deconvolution Batch shapes_1d - u8s8f32 - CPUcloverleaf: Lagrangian-Eulerian Hydrodynamicsqmcpack: simple-H2Osqlite-speedtest: Timed Time - Size 1,000lzbench: Zstd 8 - Compressionncnn: CPU - mnasnetcompress-lz4: 1 - Compression Speedlzbench: Brotli 2 - Compressionaskap: tConvolve OpenMP - Griddingunpack-firefox: firefox-84.0.source.tar.xzrav1e: 10ncnn: Vulkan GPU - mnasnettnn: CPU - MobileNet v2lzbench: Crush 0 - Compressionmnn: SqueezeNetV1.0embree: Pathtracer - Crownrav1e: 5webp2: Defaultaskap: tConvolve MPI - Griddingbasis: UASTC Level 2npb: CG.Cdav1d: Summer Nature 4Kbuild2: Time To Compilencnn: Vulkan GPU - blazefacecompress-lz4: 3 - Compression Speedrav1e: 1embree: Pathtracer ISPC - Crowncompress-lz4: 9 - Compression Speedlzbench: Brotli 0 - Compressionetcpak: ETC2vkresample: 2x - Singleembree: Pathtracer ISPC - Asian Dragonaskap: Hogbom Clean OpenMPcompress-lz4: 1 - Decompression Speedgcrypt: mnn: resnet-v2-50cryptsetup: AES-XTS 512b Encryptionredis: SADDbuild-godot: Time To Compilencnn: Vulkan GPU - efficientnet-b0cryptsetup: AES-XTS 256b Decryptionmnn: MobileNetV2_224mnn: inception-v3mafft: Multiple Sequence Alignment - LSU RNAncnn: Vulkan GPU - resnet18cryptsetup: AES-XTS 512b Decryptionlzbench: Zstd 8 - Decompressionembree: Pathtracer - Asian Dragonaskap: tConvolve MPI - Degriddingetcpak: DXT1lzbench: Libdeflate 1 - Compressionncnn: Vulkan GPU - googlenetlzbench: Zstd 1 - Compressionwebp2: Quality 75, Compression Effort 7mnn: mobilenet-v1-1.0embree: Pathtracer ISPC - Asian Dragon Objetcpak: ETC1 + Ditheringnode-web-tooling: gromacs: Water Benchmarkncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - resnet18coremark: CoreMark Size 666 - Iterations Per Secondvkresample: 2x - Doublelzbench: Brotli 0 - Decompressioncompress-zstd: 3encode-ape: WAV To APEnpb: MG.Clammps: Rhodopsin Proteincryptsetup: AES-XTS 256b Encryptionquantlib: cryptsetup: Serpent-XTS 512b Encryptionncnn: CPU - alexnetembree: Pathtracer - Asian Dragon Objncnn: Vulkan GPU - vgg16kripke: cryptsetup: Serpent-XTS 256b Decryptionastcenc: Thoroughfinancebench: Bonds OpenMPncnn: Vulkan GPU - resnet50basis: ETC1Srav1e: 6etcpak: ETC1lzbench: Zstd 1 - Decompressionncnn: Vulkan GPU - mobilenetcryptsetup: Serpent-XTS 512b Decryptionncnn: CPU - googlenetcryptsetup: Serpent-XTS 256b Encryptioncompress-lz4: 9 - Decompression Speedencode-opus: WAV To Opus Encodelzbench: Brotli 2 - Decompressioncryptsetup: Twofish-XTS 256b Decryptionncnn: CPU - yolov4-tinyaskap: tConvolve OpenMP - Degriddingnpb: LU.Castcenc: Fastwebp2: Quality 95, Compression Effort 7phpbench: PHP Benchmark Suitecp2k: Fayalite-FIST Dataai-benchmark: Device Inference Scorebasis: UASTC Level 2 + RDO Post-Processingncnn: CPU - vgg16indigobench: CPU - Supercartnn: CPU - SqueezeNet v1.1onnx: shufflenet-v2-10 - OpenMP CPUdav1d: Chimera 1080p 10-bitcryptsetup: Twofish-XTS 512b Encryptionwebp2: Quality 100, Compression Effort 5ncnn: Vulkan GPU - yolov4-tinyaskap: tConvolve MT - Griddingopenfoam: Motorbike 30Mindigobench: CPU - Bedroomnpb: FT.Ccryptsetup: Twofish-XTS 256b Encryptioncryptsetup: Twofish-XTS 512b Decryptionai-benchmark: Device AI Scorencnn: CPU - squeezenet_ssdastcenc: Exhaustivebasis: UASTC Level 3askap: tConvolve MT - Degriddingwebp2: Quality 100, Lossless Compressionhmmer: Pfam Database Searchcompress-lz4: 3 - Decompression Speedcryptsetup: PBKDF2-sha512lulesh: ncnn: Vulkan GPU - alexnetamg: financebench: Repo OpenMPcryptsetup: PBKDF2-whirlpoolncnn: CPU - mobilenetncnn: Vulkan GPU - squeezenet_ssdredis: LPUSHbasis: UASTC Level 0redis: SETsynthmark: VoiceMark_100encode-wavpack: WAV To WavPackncnn: CPU - resnet50ai-benchmark: Device Training Scoreonnx: fcn-resnet101-11 - OpenMP CPUonnx: bertsquad-10 - OpenMP CPUonnx: yolov4 - OpenMP CPUastcenc: Mediumcompress-zstd: 19simdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyalzbench: Crush 0 - Decompressionlzbench: XZ 0 - Decompressionlzbench: XZ 0 - Compressionwarsow: 1920 x 1080ddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - Multeasymapddnet: 1920 x 1080 - Fullscreen - OpenGL 3.3 - Default - RaiNyMore2ncnn: CPU - regnety_400monednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: IP Shapes 1D - u8s8f32 - CPU11a231273501.90862.9209.3192.072827879.758.5278510.26875.319394.410112703703.337.595451017.825.7817.9033.3155436.173508.552.9329.043472.527.762.446.646.6730.8153469.63870.7031.781789.90786.5876006.3425.8214.6489111763649189475986000.836004.0612969.9873558770.615.7752518.35822191.15427.623243685617.99169300.0538.53462.031815.775218.11168690.01721.4073.1585.76372.0391017.9776.95711.0965.5161155.9748.5932905.81101.12231.1622.4544.530.3867.687443.44416165.242498.5328.9844104.6406003.6229.82042.4712784.82261333.33228.92710.003394.04.61648.21212.58524.282783.617437.99111284.261208.89221322.36452392.3124.0967.9852280.77110.390.5197.8124.25243609.1158611004.3895621611.212.7705680.845.0023387.62180.5737.621.117.3402114.0317521143723.946.7871041.96875043.8159.9511.445299.463160727.56723.822.29736.35990.09.606650404.040.921161.0015451.707.13724.5176551651478.055743808.351113.972.509343.5741415688.15400.921.34640.95613.317464.541.0897365.58400.8403.7143028.66387.7195.508993.0301445.403131.2655983.515887512738.352521.1212237216749677.13802166733127.5528.691718046.719.2281980808.83615.63816.69343.67687453972616.2312.50.60.370.484501053817.888.291132.905091729976.596.852068.130494.617623.990592565612.757.23123983.136.0418.6132.1370446.953402.403.0339.743402.038.002.516.836.8631.6413382.74860.2731.034487.79188.4275899.1226.3274.5599611662466186578045902.685896.89128510.1574624640.615.8651818.37662164.20433.953197694887.88645303.9738.04961.254825.845224.18170682.72921.6353.1875.82368.3041027.9016.92721.1025.4911146.1148.1892881.77100.75230.3362.4744.840.3857.746643.74419164.078500.9118.9358103.9506043.1229.30842.2072767.82247597.67228.75510.063373.84.58947.93412.60624.152769.217487.97861278.111203.14921422.26451392.4074.0797.9535279.65810.350.5217.8424.16244321.4829321005.9325641605.612.8145669.094.9943377.02185.0735.521.057.3418113.7217477813722.246.8971207.26562543.7160.0801.442298.894160727.51722.522.25735.05983.29.602650403.440.981159.3215459.457.12723.5226542721478.482742807.286113.822.512343.2091414188.24400.521.36740.99613.906464.941.0907371.11400.5403.4142928.68387.9795.569992.4131446.293131.3095980.315879502738.006621.1112231703349656.07291766760627.5428.701717450.179.2311981355.17615.80216.68943.68687453972616.2312.50.60.370.484501053819.347.669422.625106.645368.714964.669444.102287.208521032.0532.1167451.883387.303.0328.973363.6730.9613382.38881.3431.026289.59187.3645884.7625.8334.5705011884731186016045893.195898.04128774809330.6251718.09862158.80433.447.93690303.6738.453825162.571693.1541016.89251.0925.4662901.19101.58232.22244.880.3837.740843.76417165.255500.6858.99526034.7230.795227.55512.53617398.01941208.664214450394.0137.9747280.583244509.6366681007.9945621606.612.7755661.405.0112187.17.36111.443299.21516105980.19.61765115474.021480.06688.20464.977369.78131.2355981.82739.312812233446712.50.60.370.48450105387.814822.65725OpenBenchmarking.org

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap - Total Frame Time1510152025Min: 3.69 / Avg: 4.77 / Max: 18.371. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time

OpenBenchmarking.orgMilliseconds, Fewer Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2 - Total Frame Time1510152025Min: 8.52 / Avg: 10.85 / Max: 18.851. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

Redis

Test: LPOP

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPOP1a2600K1200K1800K2400K3000KSE +/- 6710.43, N = 3SE +/- 6466.66, N = 32827879.751729976.591. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.11268, N = 3SE +/- 0.07900, N = 15SE +/- 0.04844, N = 158.527856.852066.64536MIN: 8.22MIN: 6.11MIN: 5.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPU1a233691215SE +/- 0.04090, N = 3SE +/- 0.02351, N = 3SE +/- 0.13403, N = 310.268708.130498.71496MIN: 10.12MIN: 8.05MIN: 8.531. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_3d - Data Type: u8s8f32 - Engine: CPU1a231.19692.39383.59074.78765.9845SE +/- 0.03383, N = 3SE +/- 0.01117, N = 3SE +/- 0.02611, N = 35.319394.617624.66944MIN: 5.1MIN: 4.49MIN: 4.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: u8s8f32 - Engine: CPU1a230.99231.98462.97693.96924.9615SE +/- 0.03942, N = 3SE +/- 0.00961, N = 3SE +/- 0.02669, N = 34.410113.990594.10228MIN: 4.14MIN: 3.92MIN: 3.981. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Redis

Test: GET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: GET1a2600K1200K1800K2400K3000KSE +/- 17672.66, N = 3SE +/- 10642.86, N = 32703703.332565612.751. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.02011, N = 3SE +/- 0.02083, N = 3SE +/- 0.00510, N = 37.595457.231237.20852MIN: 7.47MIN: 7.1MIN: 7.121. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

Test / Class: EP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.C1a232004006008001000SE +/- 16.69, N = 3SE +/- 1.43, N = 3SE +/- 1.30, N = 31017.82983.131032.051. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

NCNN

Target: Vulkan GPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v3-v3 - Model: mobilenet-v31a2246810SE +/- 0.03, N = 3SE +/- 0.19, N = 35.786.04MIN: 5.59 / MAX: 6.95MIN: 5.6 / MAX: 7.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: regnety_400m1a2510152025SE +/- 0.22, N = 3SE +/- 0.08, N = 317.9018.61MIN: 17.15 / MAX: 18.95MIN: 17.25 / MAX: 201. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPU1a23816243240SE +/- 0.39, N = 3SE +/- 0.09, N = 3SE +/- 0.08, N = 333.3232.1432.12MIN: 32.4MIN: 31.94MIN: 31.911. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

dav1d

Video Input: Chimera 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p1a23100200300400500SE +/- 0.72, N = 3SE +/- 1.54, N = 3SE +/- 1.05, N = 3436.17446.95451.88MIN: 338.67 / MAX: 653.63MIN: 335.55 / MAX: 660.93MIN: 337.72 / MAX: 664.931. (CC) gcc options: -pthread

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: f32 - Engine: CPU1a238001600240032004000SE +/- 12.64, N = 3SE +/- 1.31, N = 3SE +/- 6.49, N = 33508.553402.403387.30MIN: 3442.98MIN: 3364.88MIN: 3338.561. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

CLOMP

Static OMP Speedup

OpenBenchmarking.orgSpeedup, More Is BetterCLOMP 1.2Static OMP Speedup1a230.6751.352.0252.73.375SE +/- 0.03, N = 3SE +/- 0.03, N = 9SE +/- 0.03, N = 32.93.03.01. (CC) gcc options: -fopenmp -O3 -lm

Numpy Benchmark

OpenBenchmarking.orgScore, More Is BetterNumpy Benchmark1a2370140210280350SE +/- 0.25, N = 3SE +/- 0.16, N = 3SE +/- 0.58, N = 3329.04339.74328.97

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU1a237001400210028003500SE +/- 42.16, N = 3SE +/- 7.92, N = 3SE +/- 1.67, N = 33472.523402.033363.67MIN: 3371.23MIN: 3340.17MIN: 3322.791. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NCNN

Target: Vulkan GPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.03, N = 3SE +/- 0.14, N = 37.768.00MIN: 7.45 / MAX: 9.82MIN: 7.65 / MAX: 9.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: blazeface1a20.56481.12961.69442.25922.824SE +/- 0.06, N = 3SE +/- 0.01, N = 32.442.51MIN: 2.28 / MAX: 2.98MIN: 2.35 / MAX: 3.031. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: shufflenet-v21a2246810SE +/- 0.09, N = 3SE +/- 0.11, N = 36.646.83MIN: 6.5 / MAX: 8.16MIN: 6.65 / MAX: 7.981. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: shufflenet-v21a2246810SE +/- 0.08, N = 3SE +/- 0.18, N = 36.676.86MIN: 6.51 / MAX: 8.11MIN: 6.5 / MAX: 8.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

eSpeak-NG Speech Engine

Text-To-Speech Synthesis

OpenBenchmarking.orgSeconds, Fewer Is BettereSpeak-NG Speech Engine 20200907Text-To-Speech Synthesis1a23714212835SE +/- 0.44, N = 4SE +/- 0.22, N = 20SE +/- 0.19, N = 430.8231.6430.961. (CC) gcc options: -O2 -std=c99

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU1a237001400210028003500SE +/- 27.44, N = 3SE +/- 13.00, N = 3SE +/- 14.91, N = 33469.633382.743382.38MIN: 3389.1MIN: 3319.45MIN: 3321.671. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

NAS Parallel Benchmarks

Test / Class: EP.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: EP.D1a232004006008001000SE +/- 10.93, N = 12SE +/- 8.64, N = 12SE +/- 13.60, N = 3870.70860.27881.341. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Convolution Batch Shapes Auto - Data Type: u8s8f32 - Engine: CPU1a23714212835SE +/- 0.18, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 331.7831.0331.03MIN: 31.28MIN: 30.81MIN: 30.841. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Timed Eigen Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Eigen Compilation 3.3.9Time To Compile1a2320406080100SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.11, N = 389.9187.7989.59

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 4.2.2Time To Compile1a2320406080100SE +/- 0.42, N = 3SE +/- 1.19, N = 3SE +/- 0.24, N = 386.5988.4387.36

oneDNN

Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPU1a2313002600390052006500SE +/- 16.93, N = 3SE +/- 3.24, N = 3SE +/- 6.65, N = 36006.345899.125884.76MIN: 5936.3MIN: 5840.53MIN: 5829.641. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Cython Benchmark

Test: N-Queens

OpenBenchmarking.orgSeconds, Fewer Is BetterCython Benchmark 0.29.21Test: N-Queens1a23612182430SE +/- 0.03, N = 3SE +/- 0.25, N = 15SE +/- 0.09, N = 325.8226.3325.83

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU1a231.0462.0923.1384.1845.23SE +/- 0.04669, N = 8SE +/- 0.04208, N = 10SE +/- 0.04478, N = 94.648914.559964.57050MIN: 3.94MIN: 3.76MIN: 3.811. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 12Total Time1a233M6M9M12M15MSE +/- 37547.46, N = 3SE +/- 47603.72, N = 3SE +/- 160953.33, N = 41176364911662466118847311. (CXX) g++ options: -m64 -lpthread -fno-exceptions -std=c++17 -pedantic -O3 -msse -msse3 -mpopcnt -msse4.1 -mssse3 -msse2 -flto -flto=jobserver

asmFish

1024 Hash Memory, 26 Depth

OpenBenchmarking.orgNodes/second, More Is BetterasmFish 2018-07-231024 Hash Memory, 26 Depth1a234M8M12M16M20MSE +/- 239170.10, N = 3SE +/- 47854.09, N = 3SE +/- 209621.37, N = 3189475981865780418601604

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU1a2313002600390052006500SE +/- 19.70, N = 3SE +/- 13.63, N = 3SE +/- 4.88, N = 36000.835902.685893.19MIN: 5915.81MIN: 5841.04MIN: 5838.511. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU1a2313002600390052006500SE +/- 18.98, N = 3SE +/- 6.73, N = 3SE +/- 12.51, N = 36004.065896.895898.04MIN: 5933.15MIN: 5837.81MIN: 5828.751. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

VkFFT

OpenBenchmarking.orgBenchmark Score, More Is BetterVkFFT 1.1.111a2330060090012001500SE +/- 1.76, N = 3SE +/- 0.67, N = 3SE +/- 0.58, N = 312731296128512871. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: efficientnet-b01a23691215SE +/- 0.14, N = 3SE +/- 0.02, N = 39.9810.15MIN: 9.65 / MAX: 19.45MIN: 9.86 / MAX: 10.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Crafty

Elapsed Time

OpenBenchmarking.orgNodes Per Second, More Is BetterCrafty 25.2Elapsed Time1a231.6M3.2M4.8M6.4M8MSE +/- 25940.20, N = 3SE +/- 26469.09, N = 3SE +/- 8686.26, N = 37355877746246474809331. (CC) gcc options: -pthread -lstdc++ -fprofile-use -lm

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: DistinctUserID1a230.13950.2790.41850.5580.6975SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.610.610.621. (CXX) g++ options: -O3 -pthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v3-v3 - Model: mobilenet-v31a21.31852.6373.95555.2746.5925SE +/- 0.03, N = 3SE +/- 0.04, N = 35.775.86MIN: 5.57 / MAX: 7.01MIN: 5.62 / MAX: 7.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

VKMark

Resolution: 1920 x 1080

OpenBenchmarking.orgVKMark Score, More Is BetterVKMark 2020-05-21Resolution: 1920 x 10801a23110220330440550SE +/- 2.40, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 35255185171. (CXX) g++ options: -pthread -ldl -pipe -std=c++14 -MD -MQ -MF

oneDNN

Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 3D - Data Type: f32 - Engine: CPU1a23510152025SE +/- 0.15, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 318.3618.3818.10MIN: 17.58MIN: 17.99MIN: 17.71. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

Quantum ESPRESSO

Input: AUSURF112

OpenBenchmarking.orgSeconds, Fewer Is BetterQuantum ESPRESSO 6.7Input: AUSURF1121a235001000150020002500SE +/- 20.55, N = 3SE +/- 13.36, N = 3SE +/- 6.30, N = 32191.152164.202158.801. (F9X) gfortran options: -lopenblas -lFoX_dom -lFoX_sax -lFoX_wxml -lFoX_common -lFoX_utils -lFoX_fsys -lfftw3 -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz

dav1d

Video Input: Summer Nature 1080p

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 1080p1a2390180270360450SE +/- 0.99, N = 3SE +/- 0.48, N = 3SE +/- 1.00, N = 3427.62433.95433.44MIN: 366.19 / MAX: 462MIN: 372.19 / MAX: 467.89MIN: 370.03 / MAX: 468.991. (CC) gcc options: -pthread

ONNX Runtime

Model: super-resolution-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: super-resolution-10 - Device: OpenMP CPU1a27001400210028003500SE +/- 6.71, N = 3SE +/- 34.43, N = 3324331971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.30.8VGR Performance Metric1a215K30K45K60K75K68561694881. (CXX) g++ options: -std=c++11 -pipe -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -pedantic -rdynamic -lSM -lICE -lXi -lGLU -lGL -lGLdispatch -lX11 -lXext -lXrender -lpthread -ldl -luuid -lm

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: u8s8f32 - Engine: CPU1a23246810SE +/- 0.12949, N = 12SE +/- 0.12374, N = 12SE +/- 0.12812, N = 127.991697.886457.93690MIN: 6.38MIN: 6.33MIN: 6.331. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

CloverLeaf

Lagrangian-Eulerian Hydrodynamics

OpenBenchmarking.orgSeconds, Fewer Is BetterCloverLeafLagrangian-Eulerian Hydrodynamics1a2370140210280350SE +/- 0.03, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3300.05303.97303.671. (F9X) gfortran options: -O3 -march=native -funroll-loops -fopenmp

QMCPACK

Input: simple-H2O

OpenBenchmarking.orgTotal Execution Time - Seconds, Fewer Is BetterQMCPACK 3.10Input: simple-H2O1a23918273645SE +/- 0.55, N = 4SE +/- 0.63, N = 3SE +/- 0.55, N = 438.5338.0538.451. (CXX) g++ options: -fopenmp -finline-limit=1000 -fstrict-aliasing -funroll-all-loops -march=native -O3 -fomit-frame-pointer -ffast-math -pthread -lm

SQLite Speedtest

Timed Time - Size 1,000

OpenBenchmarking.orgSeconds, Fewer Is BetterSQLite Speedtest 3.30Timed Time - Size 1,0001a21428425670SE +/- 0.23, N = 3SE +/- 0.20, N = 362.0361.251. (CC) gcc options: -O2 -ldl -lz -lpthread

lzbench

Test: Zstd 8 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Compression1a23204060801008182821. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mnasnet1a21.3142.6283.9425.2566.57SE +/- 0.10, N = 3SE +/- 0.10, N = 35.775.84MIN: 5.35 / MAX: 7.36MIN: 5.28 / MAX: 7.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LZ4 Compression

Compression Level: 1 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Compression Speed1a2311002200330044005500SE +/- 1.32, N = 3SE +/- 3.55, N = 3SE +/- 4.86, N = 35218.115224.185162.571. (CC) gcc options: -O3

lzbench

Test: Brotli 2 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Compression1a2340801201602001681701691. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

ASKAP

Test: tConvolve OpenMP - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Gridding1a2150300450600750SE +/- 9.06, N = 3SE +/- 2.68, N = 3690.02682.731. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Unpacking Firefox

Extracting: firefox-84.0.source.tar.xz

OpenBenchmarking.orgSeconds, Fewer Is BetterUnpacking Firefox 84.0Extracting: firefox-84.0.source.tar.xz1a2510152025SE +/- 0.24, N = 7SE +/- 0.32, N = 421.4121.64

rav1e

Speed: 10

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 101a230.71711.43422.15132.86843.5855SE +/- 0.004, N = 3SE +/- 0.027, N = 3SE +/- 0.002, N = 33.1583.1873.154

NCNN

Target: Vulkan GPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mnasnet1a21.30952.6193.92855.2386.5475SE +/- 0.11, N = 3SE +/- 0.12, N = 35.765.82MIN: 5.31 / MAX: 7.49MIN: 5.3 / MAX: 7.261. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

TNN

Target: CPU - Model: MobileNet v2

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: MobileNet v21a280160240320400SE +/- 2.01, N = 3SE +/- 0.47, N = 3372.04368.30MIN: 368.17 / MAX: 376.4MIN: 366.89 / MAX: 369.961. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

lzbench

Test: Crush 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Compression1a2320406080100SE +/- 0.33, N = 3SE +/- 1.00, N = 31011021011. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Mobile Neural Network

Model: SqueezeNetV1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: SqueezeNetV1.01a2246810SE +/- 0.010, N = 3SE +/- 0.038, N = 37.9777.901MIN: 7.4 / MAX: 21.28MIN: 7.36 / MAX: 20.811. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Crown1a23246810SE +/- 0.0934, N = 4SE +/- 0.1026, N = 4SE +/- 0.0890, N = 56.95716.92726.8925MIN: 6.68 / MAX: 8.51MIN: 6.63 / MAX: 8.55MIN: 6.62 / MAX: 8.55

rav1e

Speed: 5

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 51a230.2480.4960.7440.9921.24SE +/- 0.004, N = 3SE +/- 0.003, N = 3SE +/- 0.001, N = 31.0961.1021.092

WebP2 Image Encode

Encode Settings: Default

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Default1a231.24112.48223.72334.96446.2055SE +/- 0.029, N = 3SE +/- 0.020, N = 3SE +/- 0.030, N = 35.5165.4915.4661. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

ASKAP

Test: tConvolve MPI - Gridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Gridding1a22004006008001000SE +/- 5.07, N = 3SE +/- 12.47, N = 31155.971146.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Basis Universal

Settings: UASTC Level 2

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 21a21122334455SE +/- 0.69, N = 4SE +/- 0.77, N = 348.5948.191. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C1a236001200180024003000SE +/- 1.91, N = 3SE +/- 0.89, N = 3SE +/- 2.44, N = 32905.812881.772901.191. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

dav1d

Video Input: Summer Nature 4K

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Summer Nature 4K1a2320406080100SE +/- 0.28, N = 3SE +/- 0.37, N = 3SE +/- 0.36, N = 3101.12100.75101.58MIN: 91.52 / MAX: 106.08MIN: 83.92 / MAX: 106.22MIN: 87.94 / MAX: 106.991. (CC) gcc options: -pthread

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile1a2350100150200250SE +/- 0.90, N = 3SE +/- 1.08, N = 3SE +/- 1.00, N = 3231.16230.34232.22

NCNN

Target: Vulkan GPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: blazeface1a20.55581.11161.66742.22322.779SE +/- 0.06, N = 3SE +/- 0.05, N = 32.452.47MIN: 2.29 / MAX: 2.85MIN: 2.31 / MAX: 2.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

LZ4 Compression

Compression Level: 3 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Compression Speed1a231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 344.5344.8444.881. (CC) gcc options: -O3

rav1e

Speed: 1

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 11a230.08690.17380.26070.34760.4345SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.000, N = 30.3860.3850.383

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Crown1a23246810SE +/- 0.1307, N = 3SE +/- 0.0480, N = 3SE +/- 0.0611, N = 37.68747.74667.7408MIN: 7.27 / MAX: 9.52MIN: 7.5 / MAX: 9.39MIN: 7.48 / MAX: 9.45

LZ4 Compression

Compression Level: 9 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Compression Speed1a231020304050SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 343.4443.7443.761. (CC) gcc options: -O3

lzbench

Test: Brotli 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Compression1a2390180270360450SE +/- 0.33, N = 34164194171. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Etcpak

Configuration: ETC2

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC21a234080120160200SE +/- 0.01, N = 3SE +/- 0.90, N = 3SE +/- 0.01, N = 3165.24164.08165.261. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

VkResample

Upscale: 2x - Precision: Single

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Single11a23110220330440550SE +/- 0.26, N = 3SE +/- 0.33, N = 3SE +/- 0.18, N = 3SE +/- 0.25, N = 3501.91498.53500.91500.691. (CXX) g++ options: -O3 -pthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon1a233691215SE +/- 0.0549, N = 3SE +/- 0.0413, N = 3SE +/- 0.0858, N = 38.98448.93588.9952MIN: 8.77 / MAX: 9.8MIN: 8.75 / MAX: 9.83MIN: 8.75 / MAX: 9.95

ASKAP

Test: Hogbom Clean OpenMP

OpenBenchmarking.orgIterations Per Second, More Is BetterASKAP 1.0Test: Hogbom Clean OpenMP1a220406080100SE +/- 0.26, N = 3SE +/- 0.06, N = 3104.64103.951. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

LZ4 Compression

Compression Level: 1 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 1 - Decompression Speed1a2313002600390052006500SE +/- 5.95, N = 3SE +/- 0.40, N = 3SE +/- 2.77, N = 36003.66043.16034.71. (CC) gcc options: -O3

Gcrypt Library

OpenBenchmarking.orgSeconds, Fewer Is BetterGcrypt Library 1.91a2350100150200250SE +/- 0.49, N = 3SE +/- 0.19, N = 3SE +/- 0.52, N = 3229.82229.31230.801. (CC) gcc options: -O2 -fvisibility=hidden

Mobile Neural Network

Model: resnet-v2-50

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: resnet-v2-501a21020304050SE +/- 0.06, N = 3SE +/- 0.03, N = 342.4742.21MIN: 41.07 / MAX: 45.74MIN: 40.94 / MAX: 44.991. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Cryptsetup

AES-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Encryption1a26001200180024003000SE +/- 4.28, N = 3SE +/- 12.45, N = 32784.82767.8

Redis

Test: SADD

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SADD1a2500K1000K1500K2000K2500KSE +/- 6947.45, N = 3SE +/- 3348.75, N = 32261333.332247597.671. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile1a2350100150200250SE +/- 1.32, N = 3SE +/- 0.12, N = 3SE +/- 0.14, N = 3228.93228.76227.56

NCNN

Target: Vulkan GPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: efficientnet-b01a23691215SE +/- 0.13, N = 3SE +/- 0.14, N = 310.0010.06MIN: 9.67 / MAX: 11.41MIN: 9.74 / MAX: 19.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

AES-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Decryption1a27001400210028003500SE +/- 8.42, N = 3SE +/- 18.72, N = 33394.03373.8

Mobile Neural Network

Model: MobileNetV2_224

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: MobileNetV2_2241a21.03862.07723.11584.15445.193SE +/- 0.014, N = 3SE +/- 0.008, N = 34.6164.589MIN: 4 / MAX: 5.38MIN: 4 / MAX: 5.391. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Mobile Neural Network

Model: inception-v3

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: inception-v31a21122334455SE +/- 1.11, N = 3SE +/- 1.29, N = 348.2147.93MIN: 40.03 / MAX: 87.29MIN: 39.27 / MAX: 62.111. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Timed MAFFT Alignment

Multiple Sequence Alignment - LSU RNA

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MAFFT Alignment 7.471Multiple Sequence Alignment - LSU RNA1a233691215SE +/- 0.02, N = 3SE +/- 0.10, N = 3SE +/- 0.03, N = 312.5912.6112.541. (CC) gcc options: -std=c99 -O3 -lm -lpthread

NCNN

Target: Vulkan GPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet181a2612182430SE +/- 0.02, N = 3SE +/- 0.07, N = 324.2824.15MIN: 23.77 / MAX: 32.56MIN: 23.68 / MAX: 25.721. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

AES-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 512b Decryption1a26001200180024003000SE +/- 4.37, N = 3SE +/- 10.39, N = 32783.62769.2

lzbench

Test: Zstd 8 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 8 - Process: Decompression1a23400800120016002000SE +/- 11.50, N = 3SE +/- 8.19, N = 31743174817391. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon1a23246810SE +/- 0.0156, N = 3SE +/- 0.0266, N = 3SE +/- 0.0068, N = 37.99117.97868.0194MIN: 7.78 / MAX: 9.04MIN: 7.81 / MAX: 8.99MIN: 7.79 / MAX: 9.01

ASKAP

Test: tConvolve MPI - Degridding

OpenBenchmarking.orgMpix/sec, More Is BetterASKAP 1.0Test: tConvolve MPI - Degridding1a230060090012001500SE +/- 8.33, N = 3SE +/- 11.46, N = 31284.261278.111. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

Etcpak

Configuration: DXT1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: DXT11a2330060090012001500SE +/- 1.99, N = 3SE +/- 0.69, N = 3SE +/- 1.26, N = 31208.891203.151208.661. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Libdeflate 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Libdeflate 1 - Process: Compression1a23501001502002502132142141. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

Target: Vulkan GPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: googlenet1a2510152025SE +/- 0.23, N = 3SE +/- 0.36, N = 322.3622.26MIN: 21.06 / MAX: 23.65MIN: 21.15 / MAX: 23.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

lzbench

Test: Zstd 1 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Compression1a23100200300400500SE +/- 0.58, N = 3SE +/- 0.67, N = 3SE +/- 1.53, N = 34524514501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

WebP2 Image Encode

Encode Settings: Quality 75, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 75, Compression Effort 71a2390180270360450SE +/- 1.12, N = 3SE +/- 1.91, N = 3SE +/- 1.58, N = 3392.31392.41394.011. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Mobile Neural Network

Model: mobilenet-v1-1.0

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 1.1.1Model: mobilenet-v1-1.01a20.92161.84322.76483.68644.608SE +/- 0.008, N = 3SE +/- 0.012, N = 34.0964.079MIN: 3.86 / MAX: 17.27MIN: 3.87 / MAX: 16.371. (CXX) g++ options: -std=c++11 -O3 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer ISPC - Model: Asian Dragon Obj1a23246810SE +/- 0.0235, N = 3SE +/- 0.0122, N = 3SE +/- 0.0138, N = 37.98527.95357.9747MIN: 7.77 / MAX: 8.68MIN: 7.72 / MAX: 8.73MIN: 7.77 / MAX: 8.72

Etcpak

Configuration: ETC1 + Dithering

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC1 + Dithering1a2360120180240300SE +/- 0.11, N = 3SE +/- 0.99, N = 3SE +/- 0.04, N = 3280.77279.66280.581. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

Node.js V8 Web Tooling Benchmark

OpenBenchmarking.orgruns/s, More Is BetterNode.js V8 Web Tooling Benchmark1a23691215SE +/- 0.08, N = 3SE +/- 0.11, N = 310.3910.351. Nodejs v12.18.2

GROMACS

Water Benchmark

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2020.3Water Benchmark1a20.11720.23440.35160.46880.586SE +/- 0.002, N = 3SE +/- 0.001, N = 30.5190.5211. (CXX) g++ options: -O3 -pthread -lrt -lpthread -lm

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU-v2-v2 - Model: mobilenet-v21a2246810SE +/- 0.02, N = 3SE +/- 0.02, N = 37.817.84MIN: 7.6 / MAX: 9.81MIN: 7.59 / MAX: 9.751. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet181a2612182430SE +/- 0.01, N = 3SE +/- 0.09, N = 324.2524.16MIN: 23.71 / MAX: 26.65MIN: 23.71 / MAX: 25.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Coremark

CoreMark Size 666 - Iterations Per Second

OpenBenchmarking.orgIterations/Sec, More Is BetterCoremark 1.0CoreMark Size 666 - Iterations Per Second1a2350K100K150K200K250KSE +/- 1679.06, N = 14SE +/- 2311.32, N = 10SE +/- 2406.51, N = 9243609.12244321.48244509.641. (CC) gcc options: -O2 -lrt" -lrt

VkResample

Upscale: 2x - Precision: Double

OpenBenchmarking.orgms, Fewer Is BetterVkResample 1.0Upscale: 2x - Precision: Double1a232004006008001000SE +/- 3.58, N = 3SE +/- 3.72, N = 3SE +/- 4.35, N = 31004.391005.931007.991. (CXX) g++ options: -O3 -pthread

lzbench

Test: Brotli 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 0 - Process: Decompression1a23120240360480600SE +/- 0.88, N = 3SE +/- 0.33, N = 35625645621. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Zstd Compression

Compression Level: 3

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 31a2330060090012001500SE +/- 1.09, N = 3SE +/- 3.28, N = 3SE +/- 5.21, N = 31611.21605.61606.61. (CC) gcc options: -O3 -pthread -lz -llzma

Monkey Audio Encoding

WAV To APE

OpenBenchmarking.orgSeconds, Fewer Is BetterMonkey Audio Encoding 3.99.6WAV To APE1a233691215SE +/- 0.04, N = 5SE +/- 0.04, N = 5SE +/- 0.07, N = 512.7712.8112.781. (CXX) g++ options: -O3 -pedantic -rdynamic -lrt

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C1a2312002400360048006000SE +/- 4.97, N = 3SE +/- 1.64, N = 3SE +/- 9.93, N = 35680.845669.095661.401. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

LAMMPS Molecular Dynamics Simulator

Model: Rhodopsin Protein

OpenBenchmarking.orgns/day, More Is BetterLAMMPS Molecular Dynamics Simulator 29Oct2020Model: Rhodopsin Protein1a231.12752.2553.38254.515.6375SE +/- 0.016, N = 3SE +/- 0.013, N = 3SE +/- 0.004, N = 35.0024.9945.0111. (CXX) g++ options: -O3 -pthread -lm

Cryptsetup

AES-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupAES-XTS 256b Encryption1a27001400210028003500SE +/- 1.88, N = 3SE +/- 20.76, N = 33387.63377.0

QuantLib

OpenBenchmarking.orgMFLOPS, More Is BetterQuantLib 1.211a235001000150020002500SE +/- 18.57, N = 3SE +/- 14.18, N = 3SE +/- 17.07, N = 32180.52185.02187.11. (CXX) g++ options: -O3 -march=native -rdynamic

Cryptsetup

Serpent-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Encryption1a2160320480640800SE +/- 0.83, N = 3737.6735.5

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: alexnet1a2510152025SE +/- 0.02, N = 3SE +/- 0.06, N = 321.1121.05MIN: 20.83 / MAX: 21.88MIN: 20.87 / MAX: 21.631. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.9.0Binary: Pathtracer - Model: Asian Dragon Obj1a23246810SE +/- 0.0159, N = 3SE +/- 0.0153, N = 3SE +/- 0.0029, N = 37.34027.34187.3611MIN: 7.12 / MAX: 8.12MIN: 7.13 / MAX: 8.13MIN: 7.12 / MAX: 8.18

NCNN

Target: Vulkan GPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: vgg161a2306090120150SE +/- 0.12, N = 3SE +/- 0.06, N = 3114.03113.72MIN: 113.58 / MAX: 123.46MIN: 113.39 / MAX: 122.081. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.41a24M8M12M16M20MSE +/- 43251.00, N = 3SE +/- 22002.07, N = 317521143174778131. (CXX) g++ options: -O3 -fopenmp

Cryptsetup

Serpent-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Decryption1a2160320480640800SE +/- 0.38, N = 3SE +/- 0.94, N = 3723.9722.2

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Thorough1a21122334455SE +/- 0.50, N = 3SE +/- 0.49, N = 346.7846.891. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

FinanceBench

Benchmark: Bonds OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Bonds OpenMP1a215K30K45K60K75KSE +/- 217.85, N = 3SE +/- 331.16, N = 371041.9771207.271. (CXX) g++ options: -O3 -march=native -fopenmp

NCNN

Target: Vulkan GPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: resnet501a21020304050SE +/- 0.07, N = 3SE +/- 0.05, N = 343.8143.71MIN: 41.64 / MAX: 53.86MIN: 41.74 / MAX: 46.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Basis Universal

Settings: ETC1S

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: ETC1S1a21326395265SE +/- 0.33, N = 3SE +/- 0.31, N = 359.9560.081. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

rav1e

Speed: 6

OpenBenchmarking.orgFrames Per Second, More Is Betterrav1e 0.4Speed: 61a230.32510.65020.97531.30041.6255SE +/- 0.002, N = 3SE +/- 0.005, N = 3SE +/- 0.001, N = 31.4451.4421.443

Etcpak

Configuration: ETC1

OpenBenchmarking.orgMpx/s, More Is BetterEtcpak 0.7Configuration: ETC11a2370140210280350SE +/- 0.06, N = 3SE +/- 0.37, N = 3SE +/- 0.33, N = 3299.46298.89299.221. (CXX) g++ options: -O3 -march=native -std=c++11 -lpthread

lzbench

Test: Zstd 1 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Zstd 1 - Process: Decompression1a2330060090012001500SE +/- 3.61, N = 3SE +/- 2.89, N = 3SE +/- 1.76, N = 31607160716101. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

NCNN

Target: Vulkan GPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: mobilenet1a2612182430SE +/- 0.02, N = 3SE +/- 0.02, N = 327.5627.51MIN: 27.18 / MAX: 28.23MIN: 27.14 / MAX: 29.41. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

Serpent-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 512b Decryption1a2160320480640800SE +/- 0.50, N = 3SE +/- 1.30, N = 2723.8722.5

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: googlenet1a2510152025SE +/- 0.27, N = 3SE +/- 0.35, N = 322.2922.25MIN: 21.06 / MAX: 23.6MIN: 21.23 / MAX: 23.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Cryptsetup

Serpent-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupSerpent-XTS 256b Encryption1a2160320480640800SE +/- 0.66, N = 3SE +/- 0.32, N = 3736.3735.0

LZ4 Compression

Compression Level: 9 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 9 - Decompression Speed1a2313002600390052006500SE +/- 3.49, N = 3SE +/- 0.86, N = 3SE +/- 1.13, N = 35990.05983.25980.11. (CC) gcc options: -O3

Opus Codec Encoding

WAV To Opus Encode

OpenBenchmarking.orgSeconds, Fewer Is BetterOpus Codec Encoding 1.3.1WAV To Opus Encode1a233691215SE +/- 0.014, N = 5SE +/- 0.014, N = 5SE +/- 0.013, N = 59.6069.6029.6171. (CXX) g++ options: -fvisibility=hidden -logg -lm

lzbench

Test: Brotli 2 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Brotli 2 - Process: Decompression1a23140280420560700SE +/- 0.58, N = 36506506511. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Cryptsetup

Twofish-XTS 256b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.34, N = 3404.0403.4

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.05, N = 340.9240.98MIN: 40.49 / MAX: 49.83MIN: 40.54 / MAX: 50.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

Test: tConvolve OpenMP - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve OpenMP - Degridding1a22004006008001000SE +/- 1.69, N = 3SE +/- 1.69, N = 31161.001159.321. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C1a233K6K9K12K15KSE +/- 6.60, N = 3SE +/- 11.20, N = 3SE +/- 16.88, N = 315451.7015459.4515474.021. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

ASTC Encoder

Preset: Fast

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Fast1a2246810SE +/- 0.03, N = 3SE +/- 0.00, N = 37.137.121. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

WebP2 Image Encode

Encode Settings: Quality 95, Compression Effort 7

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 95, Compression Effort 71a2160320480640800SE +/- 0.77, N = 3SE +/- 1.84, N = 3724.52723.521. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

PHPBench

PHP Benchmark Suite

OpenBenchmarking.orgScore, More Is BetterPHPBench 0.8.1PHP Benchmark Suite1a2140K280K420K560K700KSE +/- 1381.24, N = 3SE +/- 1621.04, N = 3655165654272

CP2K Molecular Dynamics

Fayalite-FIST Data

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.1Fayalite-FIST Data1a23300600900120015001478.061478.481480.07

AI Benchmark Alpha

Device Inference Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference Score1a2160320480640800743742

Basis Universal

Settings: UASTC Level 2 + RDO Post-Processing

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 2 + RDO Post-Processing1a22004006008001000SE +/- 0.28, N = 3SE +/- 0.31, N = 3808.35807.291. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: vgg161a2306090120150SE +/- 0.05, N = 3SE +/- 0.04, N = 3113.97113.82MIN: 113.51 / MAX: 123.78MIN: 113.45 / MAX: 122.831. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

IndigoBench

Acceleration: CPU - Scene: Supercar

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Supercar1a20.56521.13041.69562.26082.826SE +/- 0.004, N = 3SE +/- 0.002, N = 32.5092.512

TNN

Target: CPU - Model: SqueezeNet v1.1

OpenBenchmarking.orgms, Fewer Is BetterTNN 0.2.3Target: CPU - Model: SqueezeNet v1.11a270140210280350SE +/- 0.07, N = 3SE +/- 0.09, N = 3343.57343.21MIN: 343.23 / MAX: 344.21MIN: 342.85 / MAX: 343.841. (CXX) g++ options: -fopenmp -pthread -fvisibility=hidden -O3 -rdynamic -ldl

ONNX Runtime

Model: shufflenet-v2-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: shufflenet-v2-10 - Device: OpenMP CPU1a23K6K9K12K15KSE +/- 22.47, N = 3SE +/- 34.58, N = 314156141411. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

dav1d

Video Input: Chimera 1080p 10-bit

OpenBenchmarking.orgFPS, More Is Betterdav1d 0.8.1Video Input: Chimera 1080p 10-bit1a2320406080100SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.11, N = 388.1588.2488.20MIN: 57.28 / MAX: 196.21MIN: 57.38 / MAX: 196.19MIN: 57.19 / MAX: 197.381. (CC) gcc options: -pthread

Cryptsetup

Twofish-XTS 512b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Encryption1a290180270360450SE +/- 0.12, N = 3SE +/- 0.31, N = 3400.9400.5

WebP2 Image Encode

Encode Settings: Quality 100, Compression Effort 5

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Compression Effort 51a2510152025SE +/- 0.21, N = 8SE +/- 0.23, N = 721.3521.371. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

NCNN

Target: Vulkan GPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: yolov4-tiny1a2918273645SE +/- 0.02, N = 3SE +/- 0.04, N = 340.9540.99MIN: 40.51 / MAX: 41.79MIN: 40.54 / MAX: 49.711. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASKAP

Test: tConvolve MT - Gridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Gridding1a2130260390520650SE +/- 0.23, N = 3SE +/- 0.37, N = 3613.32613.911. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

OpenFOAM

Input: Motorbike 30M

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 8Input: Motorbike 30M1a23100200300400500SE +/- 0.37, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3464.54464.94464.971. (CXX) g++ options: -std=c++11 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfoamToVTK -ldynamicMesh -llagrangian -lgenericPatchFields -lfileFormats -lOpenFOAM -ldl -lm

IndigoBench

Acceleration: CPU - Scene: Bedroom

OpenBenchmarking.orgM samples/s, More Is BetterIndigoBench 4.4Acceleration: CPU - Scene: Bedroom1a20.24530.49060.73590.98121.2265SE +/- 0.000, N = 3SE +/- 0.001, N = 31.0891.090

NAS Parallel Benchmarks

Test / Class: FT.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: FT.C1a2316003200480064008000SE +/- 10.76, N = 3SE +/- 9.46, N = 3SE +/- 14.95, N = 37365.587371.117369.781. (F9X) gfortran options: -O3 -march=native -pthread -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -ldl -levent -levent_pthreads -lutil -lm -lrt -lz 2. Open MPI 4.0.3

Cryptsetup

Twofish-XTS 256b Encryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 256b Encryption1a290180270360450SE +/- 0.32, N = 3SE +/- 0.38, N = 3400.8400.5

Cryptsetup

Twofish-XTS 512b Decryption

OpenBenchmarking.orgMiB/s, More Is BetterCryptsetupTwofish-XTS 512b Decryption1a290180270360450SE +/- 0.15, N = 3SE +/- 0.40, N = 2403.7403.4

AI Benchmark Alpha

Device AI Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI Score1a23006009001200150014301429

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: squeezenet_ssd1a2714212835SE +/- 0.01, N = 3SE +/- 0.02, N = 328.6628.68MIN: 28.25 / MAX: 29.49MIN: 28.3 / MAX: 29.591. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Exhaustive1a280160240320400SE +/- 0.40, N = 3SE +/- 0.31, N = 3387.71387.971. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Basis Universal

Settings: UASTC Level 3

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 31a220406080100SE +/- 0.31, N = 3SE +/- 0.30, N = 395.5195.571. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

ASKAP

Test: tConvolve MT - Degridding

OpenBenchmarking.orgMillion Grid Points Per Second, More Is BetterASKAP 1.0Test: tConvolve MT - Degridding1a22004006008001000SE +/- 0.27, N = 3SE +/- 0.15, N = 3993.03992.411. (CXX) g++ options: -O3 -fstrict-aliasing -fopenmp

WebP2 Image Encode

Encode Settings: Quality 100, Lossless Compression

OpenBenchmarking.orgSeconds, Fewer Is BetterWebP2 Image Encode 20210126Encode Settings: Quality 100, Lossless Compression1a230060090012001500SE +/- 1.95, N = 3SE +/- 1.82, N = 31445.401446.291. (CXX) g++ options: -msse4.2 -fno-rtti -O3 -rdynamic -lpthread -ljpeg

Timed HMMer Search

Pfam Database Search

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed HMMer Search 3.3.1Pfam Database Search1a23306090120150SE +/- 0.07, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3131.27131.31131.241. (CC) gcc options: -O3 -pthread -lhmmer -leasel -lm

LZ4 Compression

Compression Level: 3 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterLZ4 Compression 1.9.3Compression Level: 3 - Decompression Speed1a2313002600390052006500SE +/- 1.02, N = 3SE +/- 1.62, N = 3SE +/- 0.40, N = 35983.55980.35981.81. (CC) gcc options: -O3

Cryptsetup

PBKDF2-sha512

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-sha5121a2300K600K900K1200K1500KSE +/- 801.33, N = 315887511587950

LULESH

OpenBenchmarking.orgz/s, More Is BetterLULESH 2.0.31a236001200180024003000SE +/- 2.52, N = 3SE +/- 1.49, N = 3SE +/- 1.38, N = 32738.352738.012739.311. (CXX) g++ options: -O3 -fopenmp -lm -pthread -lmpi_cxx -lmpi

NCNN

Target: Vulkan GPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: alexnet1a2510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 321.1221.11MIN: 20.84 / MAX: 22.35MIN: 20.86 / MAX: 21.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.21a2330M60M90M120M150MSE +/- 10051.26, N = 3SE +/- 7169.92, N = 3SE +/- 8824.65, N = 31223721671223170331223344671. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -pthread -lmpi

FinanceBench

Benchmark: Repo OpenMP

OpenBenchmarking.orgms, Fewer Is BetterFinanceBench 2016-07-25Benchmark: Repo OpenMP1a211K22K33K44K55KSE +/- 562.64, N = 3SE +/- 734.76, N = 349677.1449656.071. (CXX) g++ options: -O3 -march=native -fopenmp

Cryptsetup

PBKDF2-whirlpool

OpenBenchmarking.orgIterations Per Second, More Is BetterCryptsetupPBKDF2-whirlpool1a2140K280K420K560K700KSE +/- 2257.33, N = 3SE +/- 1574.42, N = 3667331667606

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: mobilenet1a2612182430SE +/- 0.03, N = 3SE +/- 0.02, N = 327.5527.54MIN: 27.17 / MAX: 28.55MIN: 27.2 / MAX: 30.251. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: Vulkan GPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: Vulkan GPU - Model: squeezenet_ssd1a2714212835SE +/- 0.02, N = 3SE +/- 0.01, N = 328.6928.70MIN: 28.24 / MAX: 29.73MIN: 28.3 / MAX: 29.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Redis

Test: LPUSH

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: LPUSH1a2400K800K1200K1600K2000KSE +/- 8997.19, N = 3SE +/- 3103.90, N = 31718046.711717450.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Basis Universal

Settings: UASTC Level 0

OpenBenchmarking.orgSeconds, Fewer Is BetterBasis Universal 1.12Settings: UASTC Level 01a23691215SE +/- 0.005, N = 3SE +/- 0.005, N = 39.2289.2311. (CXX) g++ options: -std=c++11 -fvisibility=hidden -fPIC -fno-strict-aliasing -O3 -rdynamic -lm -lpthread

Redis

Test: SET

OpenBenchmarking.orgRequests Per Second, More Is BetterRedis 6.0.9Test: SET1a2400K800K1200K1600K2000KSE +/- 17544.22, N = 3SE +/- 8366.29, N = 31980808.831981355.171. (CXX) g++ options: -MM -MT -g3 -fvisibility=hidden -O3

Google SynthMark

Test: VoiceMark_100

OpenBenchmarking.orgVoices, More Is BetterGoogle SynthMark 20201109Test: VoiceMark_1001a2130260390520650SE +/- 0.20, N = 3SE +/- 0.18, N = 3615.64615.801. (CXX) g++ options: -lm -lpthread -std=c++11 -Ofast

WavPack Audio Encoding

WAV To WavPack

OpenBenchmarking.orgSeconds, Fewer Is BetterWavPack Audio Encoding 5.3WAV To WavPack1a248121620SE +/- 0.01, N = 5SE +/- 0.01, N = 516.6916.691. (CXX) g++ options: -rdynamic

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: resnet501a21020304050SE +/- 0.02, N = 3SE +/- 0.04, N = 343.6743.68MIN: 41.66 / MAX: 47.83MIN: 41.61 / MAX: 45.341. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

AI Benchmark Alpha

Device Training Score

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training Score1a2150300450600750687687

ONNX Runtime

Model: fcn-resnet101-11 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: fcn-resnet101-11 - Device: OpenMP CPU1a21020304050SE +/- 0.00, N = 3SE +/- 0.17, N = 345451. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: bertsquad-10 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: bertsquad-10 - Device: OpenMP CPU1a290180270360450SE +/- 1.26, N = 3SE +/- 1.15, N = 33973971. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ONNX Runtime

Model: yolov4 - Device: OpenMP CPU

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.6Model: yolov4 - Device: OpenMP CPU1a260120180240300SE +/- 0.33, N = 3SE +/- 0.17, N = 32612611. (CXX) g++ options: -fopenmp -ffunction-sections -fdata-sections -O3 -ldl -lrt

ASTC Encoder

Preset: Medium

OpenBenchmarking.orgSeconds, Fewer Is BetterASTC Encoder 2.0Preset: Medium1a2246810SE +/- 0.00, N = 3SE +/- 0.00, N = 36.236.231. (CXX) g++ options: -std=c++14 -fvisibility=hidden -O3 -flto -mfpmath=sse -mavx2 -mpopcnt -lpthread

Zstd Compression

Compression Level: 19

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.4.5Compression Level: 191a233691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 312.512.512.51. (CC) gcc options: -O3 -pthread -lz -llzma

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: PartialTweets1a230.1350.270.4050.540.675SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.60.60.61. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: LargeRandom1a230.08330.16660.24990.33320.4165SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.370.370.371. (CXX) g++ options: -O3 -pthread

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 0.7.1Throughput Test: Kostya1a230.1080.2160.3240.4320.54SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.480.480.481. (CXX) g++ options: -O3 -pthread

lzbench

Test: Crush 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: Crush 0 - Process: Decompression1a231002003004005004504504501. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Decompression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Decompression1a2320406080100SE +/- 0.33, N = 31051051051. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

lzbench

Test: XZ 0 - Process: Compression

OpenBenchmarking.orgMB/s, More Is Betterlzbench 1.8Test: XZ 0 - Process: Compression1a239182736453838381. (CXX) g++ options: -pthread -fomit-frame-pointer -fstrict-aliasing -ffast-math -O3

Warsow

Resolution: 1920 x 1080

OpenBenchmarking.orgFrames Per Second, More Is BetterWarsow 2.5 BetaResolution: 1920 x 108011428425670SE +/- 0.37, N = 362.9

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: Multeasymap150100150200250SE +/- 0.86, N = 3209.31MIN: 54.45 / MAX: 280.91. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

DDraceNetwork

Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2

OpenBenchmarking.orgFrames Per Second, More Is BetterDDraceNetwork 15.2.3Resolution: 1920 x 1080 - Mode: Fullscreen - Renderer: OpenGL 3.3 - Zoom: Default - Demo: RaiNyMore2120406080100SE +/- 0.23, N = 392.07MIN: 29.47 / MAX: 1231. (CXX) g++ options: -O3 -rdynamic -lcrypto -lz -lrt -lpthread -lcurl -lfreetype -lSDL2 -lwavpack -lopusfile -lopus -logg -lGL -lX11 -lnotify -lgdk_pixbuf-2.0 -lgio-2.0 -lgobject-2.0 -lglib-2.0

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20201218Target: CPU - Model: regnety_400m1a2510152025SE +/- 0.18, N = 3SE +/- 0.77, N = 317.8819.34MIN: 17.22 / MAX: 18.84MIN: 17.15 / MAX: 29.771. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPU1a23246810SE +/- 0.06721, N = 15SE +/- 0.13886, N = 12SE +/- 0.15309, N = 128.291137.669427.81482MIN: 7.1MIN: 5.68MIN: 5.761. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread

oneDNN

Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.0Harness: IP Shapes 1D - Data Type: u8s8f32 - Engine: CPU1a230.65361.30721.96082.61443.268SE +/- 0.03413, N = 14SE +/- 0.04605, N = 12SE +/- 0.04568, N = 122.905092.625102.65725MIN: 2.26MIN: 2.08MIN: 2.11. (CXX) g++ options: -O3 -std=c++11 -fopenmp -msse4.1 -fPIC -pie -lpthread


Phoronix Test Suite v10.8.4