AMD EPYC 4th Gen AVX-512 Comparison

AMD EPYC 9654 Genoa AVX-512 benchmark comparison by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212195-NE-AVXCOMPAR69
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 3 Tests
CPU Massive 7 Tests
Creator Workloads 9 Tests
Cryptography 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Machine Learning 11 Tests
Molecular Dynamics 3 Tests
Multi-Core 10 Tests
NVIDIA GPU Compute 3 Tests
Intel oneAPI 7 Tests
OpenMPI Tests 3 Tests
Python 2 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 3 Tests
Server CPU Tests 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
AVX-512 On
December 18 2022
  20 Hours, 2 Minutes
AVX-512 Off
December 18 2022
  15 Hours, 29 Minutes
Invert Hiding All Results Option
  17 Hours, 45 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 4th Gen AVX-512 ComparisonOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1002E BIOS)AMD Device 14a41520GB800GB INTEL SSDPF21Q800GBASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeUbuntu 22.106.1.0-phx (x86_64)GNOME Shell 43.0X Server 1.21.1.41.3.224GCC 12.2.0 + Clang 15.0.2-1ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionAMD EPYC 4th Gen AVX-512 Comparison BenchmarksSystem Logs- Transparent Huge Pages: madvise- AVX-512 On: CXXFLAGS="-O3 -march=native -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mprefer-vector-width=512" CFLAGS="-O3 -march=native -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mprefer-vector-width=512" - AVX-512 Off: CXXFLAGS="-O3 -march=native -mno-avx512f" CFLAGS="-O3 -march=native -mno-avx512f" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10110d - Python 3.10.7- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AVX-512 On vs. AVX-512 Off ComparisonPhoronix Test SuiteBaseline+46.4%+46.4%+92.8%+92.8%+139.2%+139.2%50.5%37.6%3.6%CPU - 16 - AlexNet185.4%R.N.N.T - bf16bf16bf16 - CPU155.1%D.T.S153.4%R.N.N.T - f32 - CPU152.5%W.P.D.F - CPU143.4%W.P.D.F - CPU143%F.D.F - CPU132.2%F.D.F - CPU131.9%A.G.R.R.0.F - CPU120%LBC, LBRY Credits117%Device AI Score115.1%M.T.E.T.D.F - CPU114.8%M.T.E.T.D.F - CPU114.7%W.P.D.F.I - CPU104.7%W.P.D.F.I - CPU104.5%F.D.F.I - CPU104.2%F.D.F.I - CPU103.9%V.D.F - CPU97.1%V.D.F - CPU97%CPU - 16 - GoogLeNet93.3%D.I.S92.8%V.D.F.I - CPU84.7%V.D.F.I - CPU84.6%D.B.s - f32 - CPU78.2%P.V.B.D.F - CPU76.5%P.V.B.D.F - CPU76.4%CPU - 16 - ResNet-5073.7%gravity_spheres_volume/dim_512/scivis/real_time71.4%Q.S.2.P70.9%P.D.F - CPU70%P.D.F - CPU69.5%P.D.F - CPU68.4%P.D.F - CPU67.7%gravity_spheres_volume/dim_512/ao/real_time67.4%scrypt62.9%A.G.R.R.0.F.I - CPU58.3%resnet-v2-5056.1%N.Q.A.B.b.u.S.1.P - A.M.S55.2%N.Q.A.B.b.u.S.1.P - A.M.S55.2%inception-v3Garlicoin46.5%OpenMP - BM144.1%OpenMP - BM144.1%N.Q.A.B.b.u.S.1.P - S.S.S41.5%N.Q.A.B.b.u.S.1.P - S.S.S41.5%C.C.R.5.I - A.M.S38.5%C.C.R.5.I - A.M.S38.4%R.N.N.I - bf16bf16bf16 - CPUA.G.R.R.0.F - CPU37.4%Skeincoin36.7%OpenMP - BM235.4%OpenMP - BM235.4%gravity_spheres_volume/dim_512/pathtracer/real_time33.8%x25x33.1%DistinctUserID30.7%PartialTweets30.4%C.D.Y.C - A.M.S24.1%C.D.Y.C - A.M.S24%Kostya23.3%2 - 1080p - 32 - Path Tracer23%2 - 4K - 16 - Path Tracer22.8%2 - 1080p - 16 - Path Tracer22.7%1 - 1080p - 32 - Path Tracer22.4%1 - 1080p - 1 - Path Tracer22.3%2 - 4K - 32 - Path Tracer22.2%1 - 4K - 1 - Path Tracer22.2%1 - 4K - 32 - Path Tracer22.1%1 - 4K - 16 - Path Tracer22.1%1 - 1080p - 16 - Path Tracer22%3 - 1080p - 16 - Path Tracer21.7%3 - 1080p - 32 - Path Tracer21.6%TopTweet21.2%3 - 4K - 1 - Path Tracer21.2%LargeRand21.2%N.T.C.B.b.u.S - A.M.S21%3 - 4K - 16 - Path Tracer21%N.T.C.B.b.u.S - A.M.S20.9%3 - 1080p - 1 - Path Tracer20.9%3 - 4K - 32 - Path Tracer20.7%Pathtracer ISPC - Asian Dragon20.4%Pathtracer ISPC - Asian Dragon Obj20.2%C.D.Y.C - S.S.S20.1%C.D.Y.C - S.S.S20.1%Pathtracer ISPC - Crown19.8%2 - 4K - 1 - Path Tracer19.5%2 - 1080p - 1 - Path Tracer19.5%N.T.C.D.m - A.M.S19%d.M.M.S - Execution Time19%N.T.C.D.m - A.M.S19%OpenMP - Points2Image18.7%super-resolution-10 - CPU - Standard17.7%N.T.C.B.b.u.S - S.S.S15.6%N.T.C.B.b.u.S - S.S.S15.6%vklBenchmark ISPC15.3%CPU - vision_transformer15%ArcFace ResNet-100 - CPU - Standard14.5%A.G.R.R.0.F.I - CPU12.8%N.T.C.B.b.u.c - S.S.S12.8%N.T.C.B.b.u.c - S.S.S12.8%N.D.C.o.b.u.o.I - S.S.S12.2%N.D.C.o.b.u.o.I - S.S.S12.2%CPU - blazeface12.1%Eigen11.4%CPU - regnety_400m9.2%F.x.A9.1%CPU - efficientnet-b08.9%fcn-resnet101-11 - CPU - Standard8.4%BLAS7.8%Fayalite-FIST6.7%SqueezeNetV1.06.6%d.M.M.S - Mesh Time6.3%N.T.C.D.m - S.S.S5.4%N.T.C.D.m - S.S.S5.4%Preset 12 - Bosphorus 4K5%4.9%JPEG - 904.6%CPU - googlenet4.6%PNG - 904.5%CPU - mnasnet4.5%B.C4.3%C.B.S.A - f32 - CPU4.2%RTLightmap.hdr.4096x4096bertsquad-12 - CPU - Standard3.4%Windowed Gaussian3.3%CPU - FastestDet3%Preset 13 - Bosphorus 4K2.8%JPEG - 1002.8%OpenMP - NDT Mapping2.7%TensorFlowoneDNNAI Benchmark AlphaoneDNNOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOCpuminer-OptAI Benchmark AlphaOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOTensorFlowAI Benchmark AlphaOpenVINOOpenVINOoneDNNOpenVINOOpenVINOTensorFlowOSPRayCpuminer-OptOpenVINOOpenVINOOpenVINOOpenVINOOSPRayCpuminer-OptOpenVINOMobile Neural NetworkNeural Magic DeepSparseNeural Magic DeepSparseMobile Neural NetworkCpuminer-OptminiBUDEminiBUDENeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseoneDNNOpenVINOCpuminer-OptminiBUDEminiBUDEOSPRayCpuminer-OptsimdjsonsimdjsonNeural Magic DeepSparseNeural Magic DeepSparsesimdjsonOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudiosimdjsonOSPRay StudiosimdjsonNeural Magic DeepSparseOSPRay StudioNeural Magic DeepSparseOSPRay StudioOSPRay StudioEmbreeEmbreeNeural Magic DeepSparseNeural Magic DeepSparseEmbreeOSPRay StudioOSPRay StudioNeural Magic DeepSparseOpenFOAMNeural Magic DeepSparseDarmstadt Automotive Parallel Heterogeneous SuiteONNX RuntimeNeural Magic DeepSparseNeural Magic DeepSparseOpenVKLNCNNONNX RuntimeOpenVINONeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseNCNNLeelaChessZeroNCNNSMHasherNCNNONNX RuntimeLeelaChessZeroCP2K Molecular DynamicsMobile Neural NetworkOpenFOAMNeural Magic DeepSparseNeural Magic DeepSparseSVT-AV1Numpy BenchmarkJPEG XL libjxlNCNNJPEG XL libjxlNCNNNumenta Anomaly BenchmarkoneDNNIntel Open Image DenoiseONNX RuntimeNumenta Anomaly BenchmarkNCNNSVT-AV1JPEG XL libjxlDarmstadt Automotive Parallel Heterogeneous SuiteAVX-512 OnAVX-512 Off

AMD EPYC 4th Gen AVX-512 Comparisonminibude: OpenMP - BM1minibude: OpenMP - BM2openvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Crownsvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDsimdjson: TopTweetminibude: OpenMP - BM1minibude: OpenMP - BM2oidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160oidn: RTLightmap.hdr.4096x4096tensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 16 - AlexNettensorflow: CPU - 16 - GoogLeNetonnx: fcn-resnet101-11 - CPU - Standardonnx: super-resolution-10 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardopenvkl: vklBenchmark ISPCospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timedeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamcpuminer-opt: scryptcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: x25xcpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: LBC, LBRY Creditssmhasher: FarmHash32 x86_64 AVXjpegxl: JPEG - 90jpegxl: JPEG - 100jpegxl: PNG - 90lczero: BLASlczero: Eigengromacs: MPI CPU - water_GMX50_bareai-benchmark: Device Inference Scoreai-benchmark: Device Training Scoreai-benchmark: Device AI Scorenumpy: daphne: OpenMP - NDT Mappingdaphne: OpenMP - Points2Imagesmhasher: FarmHash32 x86_64 AVXospray-studio: 1 - 1080p - 1 - Path Tracerospray-studio: 1 - 1080p - 16 - Path Tracerospray-studio: 1 - 1080p - 32 - Path Tracerospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 2 - 1080p - 1 - Path Tracerospray-studio: 2 - 1080p - 16 - Path Tracerospray-studio: 2 - 1080p - 32 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 3 - 1080p - 1 - Path Tracerospray-studio: 3 - 1080p - 16 - Path Tracerospray-studio: 3 - 1080p - 32 - Path Tracerospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 3 - 4K - 32 - Path Traceronednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: inception-v3ncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - resnet50ncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamnumenta-nab: Bayesian Changepointnumenta-nab: Windowed Gaussiannumenta-nab: Relative Entropyopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timecp2k: Fayalite-FISTAVX-512 OnAVX-512 Off291.982346.081102.04193.93148967.98170652.7143.3443.3419800.209988.4411202.627452.969065.34956.88184.4701212.9364180.9315246.572245.5556.701.264.186.866.527299.5458652.0053.513.511.6622.15157.2960.1727174015161051133244.156643.071154.218784.9988616.6804187.08371195.93671953.025534.5206100.5458761.2241189.5611857.226534.25184800.6122606807696.06715531951330106695039794.449.550.749.959077909618.764351027016211575.661371.1813677.76449484326.38714823534698581927118546154238447646049385188081772804560869411070221840.51616722.58750.9895911962.061955.392361.2715.4418.57945.81242.3057.7125.8872.8066.34247.3274.9358.95469.29246.980.550.361100.251101.009.634.794.286.435.2950.1111.7586155.34625.341380.082649.032828.96099.9408125.78605.2712111.728229.189216.6774.7279.898135.77418113.637331122.725202.604255.66143.9494.97108449.49151239.6025.5025.749672.944110.496065.763782.085135.56445.59153.4476176.8831151.0558239.881233.9445.141.043.395.255.385065.0966391.5193.503.511.7212.7555.1131.122506288499918115526.379425.135440.511773.5380509.4462177.53871005.16151410.805130.606271.0437490.3658157.819690.872230.52672946.0813229605784.2548827142754349174036483.709.130.729.528423816218.467182110662887548.831335.6311521.3826.4121812871574971011321226471842924586172211524229852143413682084113398267850.53768222.83541.763364953.984989.131715.7624.0989.14630.43644.1962.8629.0076.1367.48270.1286.1660.731088.50503.501.210.571864.931845.9519.6911.667.9012.679.33107.6213.5907187.85405.628295.334767.911832.664714.0655195.16566.3313138.586132.750517.3894.88210.035144.3478135.216481198.408OpenBenchmarking.org

CPU Temperature Monitor

OpenBenchmarking.orgCelsiusCPU Temperature MonitorPhoronix Test Suite System MonitoringAVX-512 OnAVX-512 Off1428425670Min: 30.13 / Avg: 49.97 / Max: 73.38Min: 35.5 / Avg: 51.26 / Max: 73.75

CPU Power Consumption Monitor

OpenBenchmarking.orgWattsCPU Power Consumption MonitorPhoronix Test Suite System MonitoringAVX-512 OnAVX-512 Off130260390520650Min: 26.37 / Avg: 434.8 / Max: 766.01Min: 106.95 / Avg: 449.58 / Max: 735.32

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OnAVX-512 Off60120180240300SE +/- 0.23, N = 10SE +/- 1.06, N = 8291.98202.601. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OnAVX-512 Off50100150200250Min: 291.02 / Avg: 291.98 / Max: 292.82Min: 197.67 / Avg: 202.6 / Max: 206.091. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OnAVX-512 Off80160240320400SE +/- 0.46, N = 4SE +/- 1.27, N = 3346.08255.661. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OnAVX-512 Off60120180240300Min: 345.37 / Avg: 346.08 / Max: 347.44Min: 253.22 / Avg: 255.66 / Max: 257.481. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off20406080100SE +/- 0.07, N = 3SE +/- 0.01, N = 3102.0443.94-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off20406080100Min: 101.9 / Avg: 102.04 / Max: 102.12Min: 43.92 / Avg: 43.94 / Max: 43.961. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off4080120160200SE +/- 0.05, N = 3SE +/- 0.02, N = 3193.9394.97-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off4080120160200Min: 193.85 / Avg: 193.93 / Max: 194.02Min: 94.94 / Avg: 94.97 / Max: 95.011. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OnAVX-512 Off30K60K90K120K150KSE +/- 1328.20, N = 3SE +/- 1130.85, N = 4148967.98108449.49-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OnAVX-512 Off30K60K90K120K150KMin: 147146.13 / Avg: 148967.98 / Max: 151553.12Min: 105463.52 / Avg: 108449.49 / Max: 110959.331. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off40K80K120K160K200KSE +/- 127.59, N = 3SE +/- 1455.37, N = 3170652.71151239.60-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off30K60K90K120K150KMin: 170446.67 / Avg: 170652.71 / Max: 170886.12Min: 149021.41 / Avg: 151239.6 / Max: 153980.871. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off1020304050SE +/- 0.20, N = 3SE +/- 0.22, N = 343.3425.50-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off918273645Min: 42.98 / Avg: 43.34 / Max: 43.69Min: 25.1 / Avg: 25.5 / Max: 25.871. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OnAVX-512 Off1020304050SE +/- 0.18, N = 3SE +/- 0.04, N = 343.3425.74-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OnAVX-512 Off918273645Min: 43.05 / Avg: 43.34 / Max: 43.68Min: 25.67 / Avg: 25.74 / Max: 25.821. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off4K8K12K16K20KSE +/- 12.72, N = 3SE +/- 1.81, N = 319800.209672.94-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off3K6K9K12K15KMin: 19774.94 / Avg: 19800.2 / Max: 19815.45Min: 9669.49 / Avg: 9672.94 / Max: 9675.621. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 6.96, N = 3SE +/- 4.29, N = 39988.444110.49-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KMin: 9979.24 / Avg: 9988.44 / Max: 10002.08Min: 4102.12 / Avg: 4110.49 / Max: 4116.281. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 2.53, N = 3SE +/- 1.39, N = 311202.626065.76-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KMin: 11199.3 / Avg: 11202.62 / Max: 11207.6Min: 6063.35 / Avg: 6065.76 / Max: 6068.181. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off16003200480064008000SE +/- 3.52, N = 3SE +/- 10.17, N = 37452.963782.08-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off13002600390052006500Min: 7447.61 / Avg: 7452.96 / Max: 7459.59Min: 3761.74 / Avg: 3782.08 / Max: 3792.331. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 4.01, N = 3SE +/- 2.17, N = 39065.345135.56-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off16003200480064008000Min: 9058.88 / Avg: 9065.34 / Max: 9072.69Min: 5131.78 / Avg: 5135.56 / Max: 5139.31. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OnAVX-512 Off2004006008001000SE +/- 4.00, N = 3SE +/- 3.04, N = 3956.88445.59-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OnAVX-512 Off2004006008001000Min: 948.96 / Avg: 956.88 / Max: 961.84Min: 439.5 / Avg: 445.59 / Max: 448.691. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon ObjAVX-512 OnAVX-512 Off4080120160200SE +/- 0.62, N = 4SE +/- 0.36, N = 4184.47153.45MIN: 178.83 / MAX: 196.34MIN: 130.31 / MAX: 166.17
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon ObjAVX-512 OnAVX-512 Off306090120150Min: 183.33 / Avg: 184.47 / Max: 185.82Min: 152.46 / Avg: 153.45 / Max: 154.16

OpenBenchmarking.orgFrames Per Second Per Watt, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OnAVX-512 Off0.15910.31820.47730.63640.79550.7070.431

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OnAVX-512 Off50100150200250SE +/- 0.22, N = 9SE +/- 0.21, N = 8212.94176.88MIN: 207.72 / MAX: 227.11MIN: 170.19 / MAX: 190
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OnAVX-512 Off4080120160200Min: 211.85 / Avg: 212.94 / Max: 214.13Min: 176.2 / Avg: 176.88 / Max: 178.02

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownAVX-512 OnAVX-512 Off4080120160200SE +/- 0.62, N = 8SE +/- 0.25, N = 7180.93151.06MIN: 124.41 / MAX: 209.74MIN: 114.4 / MAX: 176.43
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownAVX-512 OnAVX-512 Off306090120150Min: 178.03 / Avg: 180.93 / Max: 183.42Min: 150.19 / Avg: 151.06 / Max: 151.76

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 13 - Input: Bosphorus 4KAVX-512 OnAVX-512 Off50100150200250SE +/- 4.74, N = 15SE +/- 4.01, N = 15246.57239.88
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 13 - Input: Bosphorus 4KAVX-512 OnAVX-512 Off4080120160200Min: 205.7 / Avg: 246.57 / Max: 274.37Min: 212.49 / Avg: 239.88 / Max: 272.93

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 12 - Input: Bosphorus 4KAVX-512 OnAVX-512 Off50100150200250SE +/- 4.78, N = 15SE +/- 3.57, N = 15245.56233.94
OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 12 - Input: Bosphorus 4KAVX-512 OnAVX-512 Off4080120160200Min: 209.63 / Avg: 245.56 / Max: 275.28Min: 196.34 / Avg: 233.94 / Max: 253.62

simdjson

OpenBenchmarking.orgGB/s Per Watt, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsAVX-512 OnAVX-512 Off0.00720.01440.02160.02880.0360.0320.023

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsAVX-512 OnAVX-512 Off246810SE +/- 0.04, N = 3SE +/- 0.01, N = 36.705.14-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweetsAVX-512 OnAVX-512 Off3691215Min: 6.63 / Avg: 6.7 / Max: 6.77Min: 5.12 / Avg: 5.14 / Max: 5.151. (CXX) g++ options: -O3 -march=native

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomAVX-512 OnAVX-512 Off0.28350.5670.85051.1341.4175SE +/- 0.00, N = 3SE +/- 0.00, N = 31.261.04-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandomAVX-512 OnAVX-512 Off246810Min: 1.26 / Avg: 1.26 / Max: 1.26Min: 1.04 / Avg: 1.04 / Max: 1.041. (CXX) g++ options: -O3 -march=native

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaAVX-512 OnAVX-512 Off0.94051.8812.82153.7624.7025SE +/- 0.02, N = 3SE +/- 0.00, N = 34.183.39-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: KostyaAVX-512 OnAVX-512 Off246810Min: 4.16 / Avg: 4.18 / Max: 4.21Min: 3.39 / Avg: 3.39 / Max: 3.391. (CXX) g++ options: -O3 -march=native

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDAVX-512 OnAVX-512 Off246810SE +/- 0.02, N = 3SE +/- 0.01, N = 36.865.25-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserIDAVX-512 OnAVX-512 Off3691215Min: 6.82 / Avg: 6.86 / Max: 6.89Min: 5.24 / Avg: 5.25 / Max: 5.261. (CXX) g++ options: -O3 -march=native

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetAVX-512 OnAVX-512 Off246810SE +/- 0.08, N = 4SE +/- 0.01, N = 36.525.38-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweetAVX-512 OnAVX-512 Off3691215Min: 6.3 / Avg: 6.52 / Max: 6.67Min: 5.36 / Avg: 5.38 / Max: 5.391. (CXX) g++ options: -O3 -march=native

miniBUDE

MiniBUDE is a mini application for the the core computation of the Bristol University Docking Engine (BUDE). This test profile currently makes use of the OpenMP implementation of miniBUDE for CPU benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OnAVX-512 Off16003200480064008000SE +/- 5.79, N = 10SE +/- 26.42, N = 87299.555065.101. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1AVX-512 OnAVX-512 Off13002600390052006500Min: 7275.51 / Avg: 7299.54 / Max: 7320.47Min: 4941.74 / Avg: 5065.1 / Max: 5152.21. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 11.62, N = 4SE +/- 31.65, N = 38652.016391.521. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm
OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM2AVX-512 OnAVX-512 Off15003000450060007500Min: 8634.19 / Avg: 8652.01 / Max: 8686.09Min: 6330.61 / Avg: 6391.52 / Max: 6436.881. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x2160AVX-512 OnAVX-512 Off0.78981.57962.36943.15923.949SE +/- 0.01, N = 5SE +/- 0.03, N = 53.513.50
OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x2160AVX-512 OnAVX-512 Off246810Min: 3.47 / Avg: 3.51 / Max: 3.55Min: 3.42 / Avg: 3.5 / Max: 3.55

OpenBenchmarking.orgImages / Sec Per Watt, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x2160AVX-512 OnAVX-512 Off0.00270.00540.00810.01080.01350.0120.010

OpenBenchmarking.orgImages / Sec Per Watt, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x4096AVX-512 OnAVX-512 Off0.00110.00220.00330.00440.00550.0050.004

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x2160AVX-512 OnAVX-512 Off0.78981.57962.36943.15923.949SE +/- 0.01, N = 5SE +/- 0.01, N = 53.513.51
OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.ldr_alb_nrm.3840x2160AVX-512 OnAVX-512 Off246810Min: 3.49 / Avg: 3.51 / Max: 3.53Min: 3.47 / Avg: 3.51 / Max: 3.54

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x4096AVX-512 OnAVX-512 Off0.3870.7741.1611.5481.935SE +/- 0.01, N = 3SE +/- 0.01, N = 31.661.72
OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x4096AVX-512 OnAVX-512 Off246810Min: 1.65 / Avg: 1.66 / Max: 1.67Min: 1.7 / Avg: 1.72 / Max: 1.73

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries too. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: ResNet-50AVX-512 OnAVX-512 Off510152025SE +/- 0.11, N = 3SE +/- 0.01, N = 322.1512.75
OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: ResNet-50AVX-512 OnAVX-512 Off510152025Min: 21.93 / Avg: 22.15 / Max: 22.31Min: 12.72 / Avg: 12.75 / Max: 12.77

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: AlexNetAVX-512 OnAVX-512 Off306090120150SE +/- 2.35, N = 12SE +/- 0.19, N = 3157.2955.11
OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: AlexNetAVX-512 OnAVX-512 Off306090120150Min: 140.38 / Avg: 157.29 / Max: 168.09Min: 54.87 / Avg: 55.11 / Max: 55.49

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: GoogLeNetAVX-512 OnAVX-512 Off1326395265SE +/- 1.01, N = 15SE +/- 0.25, N = 360.1731.12
OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: GoogLeNetAVX-512 OnAVX-512 Off1224364860Min: 54.32 / Avg: 60.17 / Max: 63.8Min: 30.8 / Avg: 31.12 / Max: 31.61

ONNX Runtime

OpenBenchmarking.orgInferences Per Minute Per Watt, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off0.12330.24660.36990.49320.61650.5480.497

OpenBenchmarking.orgInferences Per Minute Per Watt, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off71421283528.5423.57

OpenBenchmarking.orgInferences Per Minute Per Watt, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off0.53531.07061.60592.14122.67652.3792.051

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off60120180240300SE +/- 0.44, N = 3SE +/- 2.36, N = 12271250-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off50100150200250Min: 270.5 / Avg: 271.33 / Max: 272Min: 236 / Avg: 250.13 / Max: 263.51. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off16003200480064008000SE +/- 5.46, N = 3SE +/- 260.54, N = 1274016288-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: super-resolution-10 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off13002600390052006500Min: 7390.5 / Avg: 7401.33 / Max: 7408Min: 5469 / Avg: 6287.88 / Max: 74121. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute Per Watt, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off0.31340.62680.94021.25361.5671.3931.315

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off110220330440550SE +/- 1.86, N = 3SE +/- 3.11, N = 3516499-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: bertsquad-12 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off90180270360450Min: 512.5 / Avg: 516.17 / Max: 518.5Min: 495.5 / Avg: 499.33 / Max: 505.51. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off2004006008001000SE +/- 39.92, N = 12SE +/- 20.83, N = 121051918-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt
OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: ArcFace ResNet-100 - Device: CPU - Executor: StandardAVX-512 OnAVX-512 Off2004006008001000Min: 888 / Avg: 1051.21 / Max: 1294.5Min: 825.5 / Avg: 918.25 / Max: 1020.51. (CXX) g++ options: -O3 -march=native -ffunction-sections -fdata-sections -mtune=native -flto=auto -fno-fat-lto-objects -ldl -lrt

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCAVX-512 OnAVX-512 Off30060090012001500SE +/- 13.58, N = 3SE +/- 8.11, N = 313321155MIN: 329 / MAX: 4770MIN: 251 / MAX: 5181
OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCAVX-512 OnAVX-512 Off2004006008001000Min: 1308 / Avg: 1332 / Max: 1355Min: 1140 / Avg: 1154.67 / Max: 1168

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_timeAVX-512 OnAVX-512 Off1020304050SE +/- 0.21, N = 3SE +/- 0.02, N = 344.1626.38
OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_timeAVX-512 OnAVX-512 Off918273645Min: 43.75 / Avg: 44.16 / Max: 44.43Min: 26.34 / Avg: 26.38 / Max: 26.42

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeAVX-512 OnAVX-512 Off1020304050SE +/- 0.10, N = 3SE +/- 0.04, N = 343.0725.14
OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeAVX-512 OnAVX-512 Off918273645Min: 42.93 / Avg: 43.07 / Max: 43.27Min: 25.05 / Avg: 25.14 / Max: 25.19

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeAVX-512 OnAVX-512 Off1224364860SE +/- 0.05, N = 3SE +/- 0.03, N = 354.2240.51
OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeAVX-512 OnAVX-512 Off1122334455Min: 54.14 / Avg: 54.22 / Max: 54.3Min: 40.45 / Avg: 40.51 / Max: 40.56

Neural Magic DeepSparse

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off20406080100SE +/- 0.45, N = 3SE +/- 0.27, N = 385.0073.54
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off1632486480Min: 84.41 / Avg: 85 / Max: 85.88Min: 73.03 / Avg: 73.54 / Max: 73.92

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off130260390520650SE +/- 1.92, N = 3SE +/- 0.33, N = 3616.68509.45
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off110220330440550Min: 613.23 / Avg: 616.68 / Max: 619.87Min: 508.96 / Avg: 509.45 / Max: 510.07

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.11, N = 3SE +/- 0.29, N = 3187.08177.54
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off306090120150Min: 186.88 / Avg: 187.08 / Max: 187.25Min: 177.08 / Avg: 177.54 / Max: 178.07

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off30060090012001500SE +/- 2.39, N = 3SE +/- 0.78, N = 31195.941005.16
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off2004006008001000Min: 1191.45 / Avg: 1195.94 / Max: 1199.62Min: 1003.93 / Avg: 1005.16 / Max: 1006.6

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off400800120016002000SE +/- 4.32, N = 3SE +/- 0.17, N = 31953.031410.81
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off30060090012001500Min: 1946.33 / Avg: 1953.03 / Max: 1961.1Min: 1410.51 / Avg: 1410.81 / Max: 1411.1

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 334.5230.61
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 34.45 / Avg: 34.52 / Max: 34.56Min: 30.54 / Avg: 30.61 / Max: 30.66

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off20406080100SE +/- 0.17, N = 3SE +/- 0.17, N = 3100.5571.04
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off20406080100Min: 100.37 / Avg: 100.55 / Max: 100.89Min: 70.71 / Avg: 71.04 / Max: 71.24

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off160320480640800SE +/- 0.79, N = 3SE +/- 0.61, N = 3761.22490.37
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off130260390520650Min: 760.22 / Avg: 761.22 / Max: 762.79Min: 489.16 / Avg: 490.37 / Max: 491.14

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.22, N = 3SE +/- 0.17, N = 3189.56157.82
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off306090120150Min: 189.14 / Avg: 189.56 / Max: 189.86Min: 157.49 / Avg: 157.82 / Max: 158.08

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off2004006008001000SE +/- 0.77, N = 3SE +/- 1.12, N = 3857.23690.87
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off150300450600750Min: 855.84 / Avg: 857.23 / Max: 858.52Min: 688.64 / Avg: 690.87 / Max: 692.09

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.08, N = 3SE +/- 0.10, N = 334.2530.53
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 34.13 / Avg: 34.25 / Max: 34.41Min: 30.33 / Avg: 30.53 / Max: 30.67

Cpuminer-Opt

Cpuminer-Opt is a fork of cpuminer-multi that carries a wide range of CPU performance optimizations for measuring the potential cryptocurrency mining performance of the CPU/processor with a wide variety of cryptocurrencies. The benchmark reports the hash speed for the CPU mining performance for the selected cryptocurrency. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: scryptAVX-512 OnAVX-512 Off10002000300040005000SE +/- 0.45, N = 3SE +/- 0.31, N = 34800.612946.08-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: scryptAVX-512 OnAVX-512 Off8001600240032004000Min: 4799.75 / Avg: 4800.61 / Max: 4801.27Min: 2945.47 / Avg: 2946.08 / Max: 2946.491. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: Quad SHA-256, PyriteAVX-512 OnAVX-512 Off500K1000K1500K2000K2500KSE +/- 9040.25, N = 3SE +/- 276.83, N = 322606801322960-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: Quad SHA-256, PyriteAVX-512 OnAVX-512 Off400K800K1200K1600K2000KMin: 2248170 / Avg: 2260680 / Max: 2278240Min: 1322630 / Avg: 1322960 / Max: 13235101. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: x25xAVX-512 OnAVX-512 Off16003200480064008000SE +/- 16.58, N = 3SE +/- 14.41, N = 37696.065784.25-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: x25xAVX-512 OnAVX-512 Off13002600390052006500Min: 7667.62 / Avg: 7696.06 / Max: 7725.06Min: 5764.56 / Avg: 5784.25 / Max: 5812.331. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: GarlicoinAVX-512 OnAVX-512 Off15K30K45K60K75KSE +/- 89.69, N = 3SE +/- 13.33, N = 37155348827-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: GarlicoinAVX-512 OnAVX-512 Off12K24K36K48K60KMin: 71380 / Avg: 71553.33 / Max: 71680Min: 48800 / Avg: 48826.67 / Max: 488401. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s Per Watt, More Is BetterCpuminer-Opt 3.20.3Algorithm: SkeincoinAVX-512 OnAVX-512 Off70014002100280035003137.562381.94

OpenBenchmarking.orgkH/s Per Watt, More Is BetterCpuminer-Opt 3.20.3Algorithm: LBC, LBRY CreditsAVX-512 OnAVX-512 Off4008001200160020001658.51802.76

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: SkeincoinAVX-512 OnAVX-512 Off400K800K1200K1600K2000KSE +/- 13005.55, N = 3SE +/- 3964.01, N = 319513301427543-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: SkeincoinAVX-512 OnAVX-512 Off300K600K900K1200K1500KMin: 1928260 / Avg: 1951330 / Max: 1973270Min: 1421210 / Avg: 1427543.33 / Max: 14348401. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp

OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: LBC, LBRY CreditsAVX-512 OnAVX-512 Off200K400K600K800K1000KSE +/- 7645.47, N = 3SE +/- 667.26, N = 31066950491740-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp
OpenBenchmarking.orgkH/s, More Is BetterCpuminer-Opt 3.20.3Algorithm: LBC, LBRY CreditsAVX-512 OnAVX-512 Off200K400K600K800K1000KMin: 1058250 / Avg: 1066950 / Max: 1082190Min: 490410 / Avg: 491740 / Max: 4925001. (CXX) g++ options: -O3 -march=native -lcurl -lz -lpthread -lssl -lcrypto -lgmp

AI Benchmark Alpha

MinAvgMaxAVX-512 On240036554107AVX-512 Off306837104285OpenBenchmarking.orgMegahertz, More Is BetterAI Benchmark Alpha 0.1.2CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

Neural Magic DeepSparse

MinAvgMaxAVX-512 On240035593801AVX-512 Off350736983967OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240031093928AVX-512 Off258931354299OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240035413702AVX-512 Off369237124154OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240028683737AVX-512 Off262329874137OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240030033730AVX-512 Off255229383959OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240035513868AVX-512 Off368637114893OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor12002400360048006000

MinAvgMaxAVX-512 On240035713764AVX-512 Off367837024150OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029453885AVX-512 Off256832144450OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor12002400360048006000

MinAvgMaxAVX-512 On240035273821AVX-512 Off306836414426OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240028793730AVX-512 Off253529083834OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240035333703AVX-512 Off308936794336OpenBenchmarking.orgMegahertz, More Is BetterNeural Magic DeepSparse 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

LeelaChessZero

OpenBenchmarking.orgMegahertz, More Is BetterLeelaChessZero 0.28CPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off9001800270036004500Min: 2400 / Avg: 3683.95 / Max: 3952Min: 3383 / Avg: 3700.34 / Max: 4961

MinAvgMaxAVX-512 On240036854074AVX-512 Off369436994052OpenBenchmarking.orgMegahertz, More Is BetterLeelaChessZero 0.28CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

OSPRay Studio

MinAvgMaxAVX-512 On240029393695AVX-512 Off270829973911OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240029303699AVX-512 Off269629794041OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029043702AVX-512 Off280229784050OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029503701AVX-512 Off269630013902OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240028883699AVX-512 Off270929373983OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240028763752AVX-512 Off276929893982OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240028593697AVX-512 Off270229953808OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240029223700AVX-512 Off278329933739OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240029353696AVX-512 Off277129723740OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240028683857AVX-512 Off271230043940OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240029063696AVX-512 Off272129603862OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240029073700AVX-512 Off272630074349OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029313701AVX-512 Off270530013951OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240029263699AVX-512 Off272129714031OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029233697AVX-512 Off271529724286OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029373698AVX-512 Off272129964398OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029103697AVX-512 Off275229564184OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029433701AVX-512 Off272729733711OpenBenchmarking.orgMegahertz, More Is BetterOSPRay Studio 0.11CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

oneDNN

MinAvgMaxAVX-512 On240031823695AVX-512 Off286033483869OpenBenchmarking.orgMegahertz, More Is BetteroneDNN 2.7CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240033943729AVX-512 Off293335263802OpenBenchmarking.orgMegahertz, More Is BetteroneDNN 2.7CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240030083695AVX-512 Off356936664064OpenBenchmarking.orgMegahertz, More Is BetteroneDNN 2.7CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240035713711AVX-512 Off358136233818OpenBenchmarking.orgMegahertz, More Is BetteroneDNN 2.7CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

OpenBenchmarking.orgMegahertz, More Is BetteroneDNN 2.7CPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off7001400210028003500Min: 2400 / Avg: 3567.66 / Max: 3704Min: 3582 / Avg: 3626.86 / Max: 4223

MinAvgMaxAVX-512 On240035073700AVX-512 Off357036124005OpenBenchmarking.orgMegahertz, More Is BetteroneDNN 2.7CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

Mobile Neural Network

OpenBenchmarking.orgMegahertz, More Is BetterMobile Neural Network 2.1CPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off7001400210028003500Min: 2400 / Avg: 3679.28 / Max: 3704Min: 3694 / Avg: 3699.72 / Max: 4041

NCNN

OpenBenchmarking.orgMegahertz, More Is BetterNCNN 20220729CPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off7001400210028003500Min: 2400 / Avg: 3565.67 / Max: 3699Min: 3563 / Avg: 3574.38 / Max: 3815

OpenVINO

MinAvgMaxAVX-512 On240031263696AVX-512 Off265229064086OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240031293696AVX-512 Off292030813713OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240036053828AVX-512 Off299231934056OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240036303754AVX-512 Off347636743707OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240034803726AVX-512 Off270931503863OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240034663698AVX-512 Off272931003993OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240030373696AVX-512 Off293030703969OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240030603695AVX-512 Off267228483946OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240030363900AVX-512 Off298830684270OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240029723706AVX-512 Off257127304230OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240030993699AVX-512 Off279929423941OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240032563707AVX-512 Off259229663926OpenBenchmarking.orgMegahertz, More Is BetterOpenVINO 2022.2.devCPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

GROMACS

MinAvgMaxAVX-512 On240031683725AVX-512 Off287533584145OpenBenchmarking.orgMegahertz, More Is BetterGROMACS 2022.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

Numpy Benchmark

MinAvgMaxAVX-512 On240036733705AVX-512 Off369437024024OpenBenchmarking.orgMegahertz, More Is BetterNumpy BenchmarkCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

Numenta Anomaly Benchmark

MinAvgMaxAVX-512 On240035323699AVX-512 Off369437074079OpenBenchmarking.orgMegahertz, More Is BetterNumenta Anomaly Benchmark 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

MinAvgMaxAVX-512 On240032373715AVX-512 Off369437183948OpenBenchmarking.orgMegahertz, More Is BetterNumenta Anomaly Benchmark 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

MinAvgMaxAVX-512 On240034373911AVX-512 Off369437023768OpenBenchmarking.orgMegahertz, More Is BetterNumenta Anomaly Benchmark 1.1CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

Darmstadt Automotive Parallel Heterogeneous Suite

MinAvgMaxAVX-512 On240035163702AVX-512 Off369437194426OpenBenchmarking.orgMegahertz, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteCPU Peak Freq (Highest CPU Core Frequency) Monitor11002200330044005500

OpenBenchmarking.orgMegahertz, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteCPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off8001600240032004000Min: 2400 / Avg: 3647.5 / Max: 3715Min: 3694 / Avg: 3698.64 / Max: 4768

OpenFOAM

MinAvgMaxAVX-512 On240031863701AVX-512 Off291732653714OpenBenchmarking.orgMegahertz, More Is BetterOpenFOAM 10CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

CP2K Molecular Dynamics

MinAvgMaxAVX-512 On240036613700AVX-512 Off356936653808OpenBenchmarking.orgMegahertz, More Is BetterCP2K Molecular Dynamics 8.2CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

JPEG XL libjxl

MinAvgMaxAVX-512 On240036573712AVX-512 Off366937013814OpenBenchmarking.orgMegahertz, More Is BetterJPEG XL libjxl 0.7CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

OpenBenchmarking.orgMegahertz, More Is BetterJPEG XL libjxl 0.7CPU Peak Freq (Highest CPU Core Frequency) MonitorAVX-512 OnAVX-512 Off8001600240032004000Min: 2400 / Avg: 3682.16 / Max: 4023Min: 3414 / Avg: 3701.38 / Max: 4381

MinAvgMaxAVX-512 On240036433712AVX-512 Off243936963828OpenBenchmarking.orgMegahertz, More Is BetterJPEG XL libjxl 0.7CPU Peak Freq (Highest CPU Core Frequency) Monitor10002000300040005000

SMHasher

SMHasher is a hash function tester supporting various algorithms and able to make use of AVX and other modern CPU instruction set extensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: FarmHash32 x86_64 AVXAVX-512 OnAVX-512 Off9K18K27K36K45KSE +/- 0.46, N = 5SE +/- 28.78, N = 539794.4436483.70-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -flto=auto -fno-fat-lto-objects
OpenBenchmarking.orgMiB/sec, More Is BetterSMHasher 2022-08-22Hash: FarmHash32 x86_64 AVXAVX-512 OnAVX-512 Off7K14K21K28K35KMin: 39792.79 / Avg: 39794.44 / Max: 39795.48Min: 36395.35 / Avg: 36483.7 / Max: 36557.871. (CXX) g++ options: -O3 -march=native -flto=auto -fno-fat-lto-objects

JPEG XL libjxl

The JPEG XL Image Coding System is designed to provide next-generation JPEG image capabilities with JPEG XL offering better image quality and compression over legacy JPEG. This test profile is currently focused on the multi-threaded JPEG XL image encode performance using the reference libjxl library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: JPEG - Quality: 90AVX-512 OnAVX-512 Off3691215SE +/- 0.04, N = 3SE +/- 0.03, N = 39.559.13-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -fno-rtti -funwind-tables -O2 -fPIE -pie -lm -latomic
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: JPEG - Quality: 90AVX-512 OnAVX-512 Off3691215Min: 9.48 / Avg: 9.55 / Max: 9.59Min: 9.07 / Avg: 9.13 / Max: 9.181. (CXX) g++ options: -O3 -march=native -fno-rtti -funwind-tables -O2 -fPIE -pie -lm -latomic

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: JPEG - Quality: 100AVX-512 OnAVX-512 Off0.16650.3330.49950.6660.8325SE +/- 0.01, N = 9SE +/- 0.01, N = 90.740.72-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -fno-rtti -funwind-tables -O2 -fPIE -pie -lm -latomic
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: JPEG - Quality: 100AVX-512 OnAVX-512 Off246810Min: 0.7 / Avg: 0.74 / Max: 0.77Min: 0.66 / Avg: 0.72 / Max: 0.771. (CXX) g++ options: -O3 -march=native -fno-rtti -funwind-tables -O2 -fPIE -pie -lm -latomic

OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: PNG - Quality: 90AVX-512 OnAVX-512 Off3691215SE +/- 0.02, N = 3SE +/- 0.02, N = 39.959.52-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -fno-rtti -funwind-tables -O2 -fPIE -pie -lm -latomic
OpenBenchmarking.orgMP/s, More Is BetterJPEG XL libjxl 0.7Input: PNG - Quality: 90AVX-512 OnAVX-512 Off3691215Min: 9.91 / Avg: 9.95 / Max: 9.99Min: 9.48 / Avg: 9.52 / Max: 9.561. (CXX) g++ options: -O3 -march=native -fno-rtti -funwind-tables -O2 -fPIE -pie -lm -latomic

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 103.04, N = 4SE +/- 21.53, N = 390778423-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAVX-512 OnAVX-512 Off16003200480064008000Min: 8776 / Avg: 9076.5 / Max: 9244Min: 8380 / Avg: 8422.67 / Max: 84491. (CXX) g++ options: -flto -O3 -march=native -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 45.37, N = 3SE +/- 43.59, N = 390968162-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAVX-512 OnAVX-512 Off16003200480064008000Min: 9020 / Avg: 9096.33 / Max: 9177Min: 8076 / Avg: 8162.33 / Max: 82161. (CXX) g++ options: -flto -O3 -march=native -pthread

GROMACS

The GROMACS (GROningen MAchine for Chemical Simulations) molecular dynamics package testing with the water_GMX50 data. This test profile allows selecting between CPU and GPU-based GROMACS builds. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bareAVX-512 OnAVX-512 Off510152025SE +/- 0.24, N = 3SE +/- 0.10, N = 318.7618.47-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native
OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bareAVX-512 OnAVX-512 Off510152025Min: 18.29 / Avg: 18.76 / Max: 19Min: 18.32 / Avg: 18.47 / Max: 18.661. (CXX) g++ options: -O3 -march=native

AI Benchmark Alpha

OpenBenchmarking.orgScore Per Watt, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreAVX-512 OnAVX-512 Off51015202520.5229.349

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreAVX-512 OnAVX-512 Off800160024003200400035101821

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreAVX-512 OnAVX-512 Off600120018002400300027011066

Numpy Benchmark

OpenBenchmarking.orgScore Per Watt, More Is BetterNumpy BenchmarkAVX-512 OnAVX-512 Off0.60461.20921.81382.41843.0232.6872.474

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreAVX-512 OnAVX-512 Off1300260039005200650062112887

Numpy Benchmark

This is a test to obtain the general Numpy performance. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkAVX-512 OnAVX-512 Off120240360480600SE +/- 2.18, N = 3SE +/- 0.59, N = 3575.66548.83
OpenBenchmarking.orgScore, More Is BetterNumpy BenchmarkAVX-512 OnAVX-512 Off100200300400500Min: 571.31 / Avg: 575.66 / Max: 578.06Min: 547.65 / Avg: 548.83 / Max: 549.45

Darmstadt Automotive Parallel Heterogeneous Suite

DAPHNE is the Darmstadt Automotive Parallel HeterogeNEous Benchmark Suite with OpenCL / CUDA / OpenMP test cases for these automotive benchmarks for evaluating programming models in context to vehicle autonomous driving capabilities. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT MappingAVX-512 OnAVX-512 Off30060090012001500SE +/- 5.94, N = 3SE +/- 8.66, N = 31371.181335.631. (CXX) g++ options: -O3 -std=c++11 -fopenmp
OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: NDT MappingAVX-512 OnAVX-512 Off2004006008001000Min: 1361.03 / Avg: 1371.18 / Max: 1381.61Min: 1319.94 / Avg: 1335.63 / Max: 1349.821. (CXX) g++ options: -O3 -std=c++11 -fopenmp

OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2ImageAVX-512 OnAVX-512 Off3K6K9K12K15KSE +/- 126.17, N = 15SE +/- 269.82, N = 1513677.7611521.381. (CXX) g++ options: -O3 -std=c++11 -fopenmp
OpenBenchmarking.orgTest Cases Per Minute, More Is BetterDarmstadt Automotive Parallel Heterogeneous SuiteBackend: OpenMP - Kernel: Points2ImageAVX-512 OnAVX-512 Off2K4K6K8K10KMin: 12686.22 / Avg: 13677.76 / Max: 14539.71Min: 9798.72 / Avg: 11521.38 / Max: 12893.441. (CXX) g++ options: -O3 -std=c++11 -fopenmp

Neural Magic DeepSparse

MinAvgMaxAVX-512 On32.443.751.5AVX-512 Off36.845.252.4OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1530456075

MinAvgMaxAVX-512 On38.649.457.0AVX-512 Off41.851.560.0OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1632486480

MinAvgMaxAVX-512 On39.048.052.4AVX-512 Off43.050.554.3OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1530456075

MinAvgMaxAVX-512 On39.951.255.5AVX-512 Off43.954.459.1OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1632486480

MinAvgMaxAVX-512 On38.151.957.3AVX-512 Off44.454.959.1OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1632486480

MinAvgMaxAVX-512 On38.947.853.8AVX-512 Off43.950.755.5OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1530456075

MinAvgMaxAVX-512 On38.148.554.8AVX-512 Off43.050.656.9OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1632486480

MinAvgMaxAVX-512 On39.351.958.3AVX-512 Off43.551.559.1OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1632486480

MinAvgMaxAVX-512 On39.850.154.6AVX-512 Off44.452.155.6OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1530456075

MinAvgMaxAVX-512 On40.152.957.9AVX-512 Off44.954.461.8OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On40.648.754.3AVX-512 Off43.550.756.9OpenBenchmarking.orgCelsius, Fewer Is BetterNeural Magic DeepSparse 1.1CPU Temperature Monitor1632486480

OSPRay Studio

MinAvgMaxAVX-512 On36.456.362.3AVX-512 Off41.858.964.5OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.558.465.0AVX-512 Off46.660.764.9OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.059.064.0AVX-512 Off46.661.368.0OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.157.962.3AVX-512 Off48.060.264.0OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.859.764.9AVX-512 Off46.662.065.4OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.960.365.4AVX-512 Off47.560.366.3OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.957.761.8AVX-512 Off47.059.964.0OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.858.462.6AVX-512 Off46.660.664.9OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.858.963.1AVX-512 Off46.661.365.8OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.457.061.4AVX-512 Off46.659.964.0OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.559.764.0AVX-512 Off46.161.665.4OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On45.359.864.9AVX-512 Off47.559.964.1OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.158.462.3AVX-512 Off46.660.364.5OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.858.765.4AVX-512 Off47.060.864.9OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.859.364.0AVX-512 Off46.661.366.3OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.958.262.8AVX-512 Off46.660.064.0OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.659.964.9AVX-512 Off46.661.664.9OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.657.962.6AVX-512 Off47.060.865.4OpenBenchmarking.orgCelsius, Fewer Is BetterOSPRay Studio 0.11CPU Temperature Monitor20406080100

oneDNN

MinAvgMaxAVX-512 On40.946.051.1AVX-512 Off43.547.753.8OpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 2.7CPU Temperature Monitor1530456075

MinAvgMaxAVX-512 On37.343.546.1AVX-512 Off39.543.847.0OpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 2.7CPU Temperature Monitor1428425670

MinAvgMaxAVX-512 On35.939.344.4AVX-512 Off39.041.443.9OpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 2.7CPU Temperature Monitor1224364860

MinAvgMaxAVX-512 On35.546.050.6AVX-512 Off38.645.950.1OpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 2.7CPU Temperature Monitor1428425670

OpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 2.7CPU Temperature MonitorAVX-512 OnAVX-512 Off1020304050Min: 37.75 / Avg: 46.67 / Max: 51.5Min: 42.63 / Avg: 46.63 / Max: 50.63

MinAvgMaxAVX-512 On37.845.848.9AVX-512 Off42.646.849.8OpenBenchmarking.orgCelsius, Fewer Is BetteroneDNN 2.7CPU Temperature Monitor1428425670

Mobile Neural Network

OpenBenchmarking.orgCelsius, Fewer Is BetterMobile Neural Network 2.1CPU Temperature MonitorAVX-512 OnAVX-512 Off1122334455Min: 35.88 / Avg: 48.36 / Max: 52Min: 40.38 / Avg: 49.52 / Max: 52.88

NCNN

OpenBenchmarking.orgCelsius, Fewer Is BetterNCNN 20220729CPU Temperature MonitorAVX-512 OnAVX-512 Off1020304050Min: 38.63 / Avg: 44.45 / Max: 48Min: 43 / Avg: 46.1 / Max: 50.13

OpenVINO

MinAvgMaxAVX-512 On35.957.364.1AVX-512 Off41.356.261.9OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.058.161.9AVX-512 Off47.062.566.8OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On42.155.459.5AVX-512 Off49.359.962.6OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On41.852.255.1AVX-512 Off48.059.262.3OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On39.557.465.9AVX-512 Off47.058.267.6OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On42.158.067.3AVX-512 Off46.158.766.8OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On42.659.462.8AVX-512 Off45.863.067.1OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.060.964.5AVX-512 Off49.860.061.8OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On44.457.960.1AVX-512 Off48.061.164.0OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On42.159.062.4AVX-512 Off48.959.962.3OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On43.856.859.6AVX-512 Off46.658.061.4OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On42.658.461.9AVX-512 Off46.156.361.4OpenBenchmarking.orgCelsius, Fewer Is BetterOpenVINO 2022.2.devCPU Temperature Monitor20406080100

Numenta Anomaly Benchmark

MinAvgMaxAVX-512 On32.440.450.1AVX-512 Off38.643.249.9OpenBenchmarking.orgCelsius, Fewer Is BetterNumenta Anomaly Benchmark 1.1CPU Temperature Monitor1428425670

MinAvgMaxAVX-512 On32.439.253.3AVX-512 Off37.842.860.4OpenBenchmarking.orgCelsius, Fewer Is BetterNumenta Anomaly Benchmark 1.1CPU Temperature Monitor20406080100

MinAvgMaxAVX-512 On31.538.847.0AVX-512 Off37.841.550.4OpenBenchmarking.orgCelsius, Fewer Is BetterNumenta Anomaly Benchmark 1.1CPU Temperature Monitor1428425670

OpenFOAM

MinAvgMaxAVX-512 On41.362.668.5AVX-512 Off47.064.970.3OpenBenchmarking.orgCelsius, Fewer Is BetterOpenFOAM 10CPU Temperature Monitor20406080100

CP2K Molecular Dynamics

MinAvgMaxAVX-512 On42.144.553.5AVX-512 Off42.144.456.9OpenBenchmarking.orgCelsius, Fewer Is BetterCP2K Molecular Dynamics 8.2CPU Temperature Monitor1632486480

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off4080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3148181-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off306090120150Min: 148 / Avg: 148.33 / Max: 149Min: 180 / Avg: 180.67 / Max: 1811. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off6001200180024003000SE +/- 2.33, N = 3SE +/- 1.20, N = 323532871-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off5001000150020002500Min: 2349 / Avg: 2352.67 / Max: 2357Min: 2869 / Avg: 2870.67 / Max: 28731. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off12002400360048006000SE +/- 3.71, N = 3SE +/- 6.56, N = 346985749-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off10002000300040005000Min: 4691 / Avg: 4698.33 / Max: 4703Min: 5741 / Avg: 5749 / Max: 57621. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off150300450600750SE +/- 1.00, N = 3SE +/- 2.08, N = 3581710-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off120240360480600Min: 580 / Avg: 581 / Max: 583Min: 706 / Avg: 710 / Max: 7131. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 10.17, N = 3SE +/- 24.46, N = 3927111321-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off2K4K6K8K10KMin: 9251 / Avg: 9271.33 / Max: 9282Min: 11288 / Avg: 11321.33 / Max: 113691. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off5K10K15K20K25KSE +/- 13.75, N = 3SE +/- 27.41, N = 31854622647-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off4K8K12K16K20KMin: 18519 / Avg: 18546 / Max: 18564Min: 22609 / Avg: 22646.67 / Max: 227001. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off4080120160200SE +/- 0.00, N = 3SE +/- 0.33, N = 3154184-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off306090120150Min: 154 / Avg: 154 / Max: 154Min: 184 / Avg: 184.33 / Max: 1851. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off6001200180024003000SE +/- 9.17, N = 3SE +/- 0.88, N = 323842924-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off5001000150020002500Min: 2372 / Avg: 2384 / Max: 2402Min: 2923 / Avg: 2924.33 / Max: 29261. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off13002600390052006500SE +/- 6.24, N = 3SE +/- 8.25, N = 347645861-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off10002000300040005000Min: 4755 / Avg: 4764 / Max: 4776Min: 5849 / Avg: 5861.33 / Max: 58771. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off160320480640800SE +/- 1.00, N = 3SE +/- 0.67, N = 3604722-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off130260390520650Min: 603 / Avg: 604 / Max: 606Min: 721 / Avg: 721.67 / Max: 7231. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 2.19, N = 3SE +/- 15.51, N = 3938511524-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off2K4K6K8K10KMin: 9381 / Avg: 9385.33 / Max: 9388Min: 11505 / Avg: 11524.33 / Max: 115551. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off5K10K15K20K25KSE +/- 20.34, N = 3SE +/- 30.02, N = 31880822985-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 2 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off4K8K12K16K20KMin: 18779 / Avg: 18807.67 / Max: 18847Min: 22942 / Avg: 22985.33 / Max: 230431. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off50100150200250SE +/- 0.00, N = 3SE +/- 0.58, N = 3177214-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off4080120160200Min: 177 / Avg: 177 / Max: 177Min: 213 / Avg: 214 / Max: 2151. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off7001400210028003500SE +/- 3.06, N = 3SE +/- 7.00, N = 328043413-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off6001200180024003000Min: 2798 / Avg: 2804 / Max: 2808Min: 3400 / Avg: 3413 / Max: 34241. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off15003000450060007500SE +/- 9.54, N = 3SE +/- 5.51, N = 356086820-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off12002400360048006000Min: 5592 / Avg: 5608 / Max: 5625Min: 6810 / Avg: 6820 / Max: 68291. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off2004006008001000SE +/- 1.53, N = 3SE +/- 1.15, N = 3694841-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off150300450600750Min: 692 / Avg: 694 / Max: 697Min: 839 / Avg: 841 / Max: 8431. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off3K6K9K12K15KSE +/- 27.14, N = 3SE +/- 22.81, N = 31107013398-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off2K4K6K8K10KMin: 11023 / Avg: 11069.67 / Max: 11117Min: 13355 / Avg: 13397.67 / Max: 134331. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off6K12K18K24K30KSE +/- 21.63, N = 3SE +/- 15.62, N = 32218426785-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off5K10K15K20K25KMin: 22142 / Avg: 22184 / Max: 22214Min: 26763 / Avg: 26784.67 / Max: 268151. (CXX) g++ options: -O3 -march=native -ldl

oneDNN

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off0.1210.2420.3630.4840.605SE +/- 0.004067, N = 7SE +/- 0.004200, N = 150.5161670.537682-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 0.42-mno-avx512f - MIN: 0.421. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Convolution Batch Shapes Auto - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off246810Min: 0.51 / Avg: 0.52 / Max: 0.53Min: 0.52 / Avg: 0.54 / Max: 0.561. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off510152025SE +/- 0.16, N = 3SE +/- 0.06, N = 322.5922.84-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 19.6-mno-avx512f - MIN: 20.221. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Deconvolution Batch shapes_1d - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off510152025Min: 22.37 / Avg: 22.59 / Max: 22.91Min: 22.74 / Avg: 22.84 / Max: 22.941. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off0.39680.79361.19041.58721.984SE +/- 0.001906, N = 9SE +/- 0.004550, N = 90.9895911.763360-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 0.9-mno-avx512f - MIN: 1.551. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Deconvolution Batch shapes_3d - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off246810Min: 0.98 / Avg: 0.99 / Max: 1Min: 1.75 / Avg: 1.76 / Max: 1.791. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off11002200330044005500SE +/- 20.13, N = 5SE +/- 60.12, N = 41962.064953.98-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 1899.34-mno-avx512f - MIN: 4690.851. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Recurrent Neural Network Training - Data Type: f32 - Engine: CPUAVX-512 OnAVX-512 Off9001800270036004500Min: 1925.73 / Avg: 1962.06 / Max: 2038.46Min: 4839.02 / Avg: 4953.98 / Max: 5101.721. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAVX-512 OnAVX-512 Off11002200330044005500SE +/- 19.67, N = 15SE +/- 41.71, N = 31955.394989.13-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 1769.7-mno-avx512f - MIN: 4767.221. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUAVX-512 OnAVX-512 Off9001800270036004500Min: 1795.46 / Avg: 1955.39 / Max: 2068.28Min: 4907.59 / Avg: 4989.13 / Max: 5045.151. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAVX-512 OnAVX-512 Off5001000150020002500SE +/- 24.41, N = 4SE +/- 22.32, N = 32361.271715.76-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 2265.7-mno-avx512f - MIN: 1607.131. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread
OpenBenchmarking.orgms, Fewer Is BetteroneDNN 2.7Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUAVX-512 OnAVX-512 Off400800120016002000Min: 2289.94 / Avg: 2361.27 / Max: 2399.91Min: 1672.8 / Avg: 1715.76 / Max: 1747.751. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Mobile Neural Network

MNN is the Mobile Neural Network as a highly efficient, lightweight deep learning framework developed by Alibaba. This MNN test profile is building the OpenMP / CPU threaded version for processor benchmarking and not any GPU-accelerated test. MNN does allow making use of AVX-512 extensions. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: resnet-v2-50AVX-512 OnAVX-512 Off612182430SE +/- 0.08, N = 9SE +/- 0.08, N = 815.4424.10-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 14.79 / MAX: 54.05-mno-avx512f - MIN: 23.44 / MAX: 71.321. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: resnet-v2-50AVX-512 OnAVX-512 Off612182430Min: 15.1 / Avg: 15.44 / Max: 15.77Min: 23.78 / Avg: 24.1 / Max: 24.431. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: SqueezeNetV1.0AVX-512 OnAVX-512 Off3691215SE +/- 0.147, N = 9SE +/- 0.091, N = 88.5799.146-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 6.67 / MAX: 21.5-mno-avx512f - MIN: 7.72 / MAX: 19.031. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: SqueezeNetV1.0AVX-512 OnAVX-512 Off3691215Min: 8.11 / Avg: 8.58 / Max: 9.2Min: 8.64 / Avg: 9.15 / Max: 9.361. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: inception-v3AVX-512 OnAVX-512 Off1020304050SE +/- 0.23, N = 9SE +/- 0.13, N = 845.8130.44-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 44.13 / MAX: 87.28-mno-avx512f - MIN: 28.75 / MAX: 109.221. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl
OpenBenchmarking.orgms, Fewer Is BetterMobile Neural Network 2.1Model: inception-v3AVX-512 OnAVX-512 Off918273645Min: 44.44 / Avg: 45.81 / Max: 46.69Min: 30.06 / Avg: 30.44 / Max: 30.971. (CXX) g++ options: -O3 -march=native -std=c++11 -fvisibility=hidden -fomit-frame-pointer -fstrict-aliasing -ffunction-sections -fdata-sections -ffast-math -fno-rtti -fno-exceptions -rdynamic -pthread -ldl

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetAVX-512 OnAVX-512 Off1020304050SE +/- 0.26, N = 8SE +/- 0.49, N = 342.3044.19-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 39.64 / MAX: 571.78-mno-avx512f - MIN: 41.56 / MAX: 148.981. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: mnasnetAVX-512 OnAVX-512 Off918273645Min: 41.23 / Avg: 42.3 / Max: 43.32Min: 43.25 / Avg: 44.19 / Max: 44.891. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0AVX-512 OnAVX-512 Off1428425670SE +/- 0.35, N = 8SE +/- 0.31, N = 357.7162.86-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 54.52 / MAX: 522.74-mno-avx512f - MIN: 59.46 / MAX: 154.821. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: efficientnet-b0AVX-512 OnAVX-512 Off1224364860Min: 56.12 / Avg: 57.71 / Max: 59.27Min: 62.41 / Avg: 62.86 / Max: 63.451. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceAVX-512 OnAVX-512 Off714212835SE +/- 0.18, N = 8SE +/- 0.54, N = 325.8829.00-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 24.45 / MAX: 112.37-mno-avx512f - MIN: 26.03 / MAX: 144.121. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: blazefaceAVX-512 OnAVX-512 Off612182430Min: 25.05 / Avg: 25.88 / Max: 26.41Min: 27.97 / Avg: 29 / Max: 29.791. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetAVX-512 OnAVX-512 Off20406080100SE +/- 0.76, N = 8SE +/- 1.35, N = 372.8076.13-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 67.13 / MAX: 388.52-mno-avx512f - MIN: 70.13 / MAX: 155.611. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: googlenetAVX-512 OnAVX-512 Off1530456075Min: 69.82 / Avg: 72.8 / Max: 75.89Min: 73.52 / Avg: 76.13 / Max: 78.011. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50AVX-512 OnAVX-512 Off1530456075SE +/- 0.50, N = 8SE +/- 0.89, N = 366.3467.48-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 62.92 / MAX: 194.74-mno-avx512f - MIN: 63.4 / MAX: 170.141. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: resnet50AVX-512 OnAVX-512 Off1326395265Min: 64.88 / Avg: 66.34 / Max: 69.42Min: 65.93 / Avg: 67.48 / Max: 69.011. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mAVX-512 OnAVX-512 Off60120180240300SE +/- 2.25, N = 8SE +/- 3.75, N = 3247.32270.12-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 232.61 / MAX: 506.55-mno-avx512f - MIN: 245.47 / MAX: 498.81. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: regnety_400mAVX-512 OnAVX-512 Off50100150200250Min: 238.55 / Avg: 247.32 / Max: 256.84Min: 265.02 / Avg: 270.12 / Max: 277.441. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerAVX-512 OnAVX-512 Off20406080100SE +/- 1.59, N = 8SE +/- 5.82, N = 374.9386.16-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 65.04 / MAX: 2154.62-mno-avx512f - MIN: 73.72 / MAX: 1760.741. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: vision_transformerAVX-512 OnAVX-512 Off1632486480Min: 70.5 / Avg: 74.93 / Max: 82.12Min: 76.86 / Avg: 86.16 / Max: 96.881. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetAVX-512 OnAVX-512 Off1428425670SE +/- 0.49, N = 8SE +/- 2.90, N = 358.9560.73-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 55.11 / MAX: 282.72-mno-avx512f - MIN: 52.89 / MAX: 236.851. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread
OpenBenchmarking.orgms, Fewer Is BetterNCNN 20220729Target: CPU - Model: FastestDetAVX-512 OnAVX-512 Off1224364860Min: 57.5 / Avg: 58.95 / Max: 61.6Min: 54.93 / Avg: 60.73 / Max: 63.841. (CXX) g++ options: -O3 -march=native -rdynamic -lgomp -lpthread

OpenVINO

This is a test of the Intel OpenVINO, a toolkit around neural networks, using its built-in benchmarking support and analyzing the throughput and latency for various models. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off2004006008001000SE +/- 0.24, N = 3SE +/- 0.51, N = 3469.291088.50-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 410.93 / MAX: 547.33-mno-avx512f - MIN: 937.5 / MAX: 1231.551. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Face Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off2004006008001000Min: 468.86 / Avg: 469.29 / Max: 469.69Min: 1087.68 / Avg: 1088.5 / Max: 1089.421. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off110220330440550SE +/- 0.01, N = 3SE +/- 0.07, N = 3246.98503.50-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 204.5 / MAX: 299.54-mno-avx512f - MIN: 402.47 / MAX: 569.651. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Face Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off90180270360450Min: 246.97 / Avg: 246.98 / Max: 247Min: 503.36 / Avg: 503.5 / Max: 503.591. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OnAVX-512 Off0.27230.54460.81691.08921.3615SE +/- 0.00, N = 3SE +/- 0.01, N = 40.551.21-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 0.5 / MAX: 29.57-mno-avx512f - MIN: 0.98 / MAX: 47.371. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16 - Device: CPUAVX-512 OnAVX-512 Off246810Min: 0.55 / Avg: 0.55 / Max: 0.55Min: 1.19 / Avg: 1.21 / Max: 1.221. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off0.12830.25660.38490.51320.6415SE +/- 0.00, N = 3SE +/- 0.00, N = 30.360.57-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 0.34 / MAX: 33.7-mno-avx512f - MIN: 0.52 / MAX: 47.931. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off246810Min: 0.36 / Avg: 0.36 / Max: 0.36Min: 0.57 / Avg: 0.57 / Max: 0.571. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off400800120016002000SE +/- 4.83, N = 3SE +/- 15.65, N = 31100.251864.93-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 783.9 / MAX: 1824.92-mno-avx512f - MIN: 1371.29 / MAX: 2532.741. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off30060090012001500Min: 1092.24 / Avg: 1100.25 / Max: 1108.94Min: 1839.4 / Avg: 1864.93 / Max: 1893.371. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OnAVX-512 Off400800120016002000SE +/- 4.44, N = 3SE +/- 3.32, N = 31101.001845.95-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 840.82 / MAX: 1792.51-mno-avx512f - MIN: 1386.31 / MAX: 27981. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Detection FP32 - Device: CPUAVX-512 OnAVX-512 Off30060090012001500Min: 1092.78 / Avg: 1101 / Max: 1108.03Min: 1839.47 / Avg: 1845.95 / Max: 1850.411. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 39.6319.69-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 8.29 / MAX: 57.53-mno-avx512f - MIN: 16.4 / MAX: 74.851. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off510152025Min: 9.62 / Avg: 9.63 / Max: 9.64Min: 19.67 / Avg: 19.69 / Max: 19.711. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off3691215SE +/- 0.01, N = 3SE +/- 0.01, N = 34.7911.66-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 3.98 / MAX: 28.79-mno-avx512f - MIN: 9.72 / MAX: 43.71. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Weld Porosity Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off3691215Min: 4.78 / Avg: 4.79 / Max: 4.8Min: 11.64 / Avg: 11.66 / Max: 11.681. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off246810SE +/- 0.00, N = 3SE +/- 0.00, N = 34.287.90-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 3.5 / MAX: 42.62-mno-avx512f - MIN: 6.37 / MAX: 41.531. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16-INT8 - Device: CPUAVX-512 OnAVX-512 Off3691215Min: 4.27 / Avg: 4.28 / Max: 4.28Min: 7.9 / Avg: 7.9 / Max: 7.911. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off3691215SE +/- 0.00, N = 3SE +/- 0.03, N = 36.4312.67-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 5.22 / MAX: 61.26-mno-avx512f - MIN: 9.58 / MAX: 77.911. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Vehicle Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off48121620Min: 6.42 / Avg: 6.43 / Max: 6.43Min: 12.64 / Avg: 12.67 / Max: 12.741. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 35.299.33-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 4.36 / MAX: 45.25-mno-avx512f - MIN: 7.39 / MAX: 57.571. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Person Vehicle Bike Detection FP16 - Device: CPUAVX-512 OnAVX-512 Off3691215Min: 5.28 / Avg: 5.29 / Max: 5.29Min: 9.33 / Avg: 9.33 / Max: 9.341. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OnAVX-512 Off20406080100SE +/- 0.21, N = 3SE +/- 0.74, N = 350.11107.62-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi - MIN: 38.98 / MAX: 166.69-mno-avx512f - MIN: 83.48 / MAX: 216.071. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared
OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.2.devModel: Machine Translation EN To DE FP16 - Device: CPUAVX-512 OnAVX-512 Off20406080100Min: 49.85 / Avg: 50.11 / Max: 50.52Min: 106.87 / Avg: 107.62 / Max: 109.11. (CXX) g++ options: -fPIC -O3 -march=native -fsigned-char -ffunction-sections -fdata-sections -fno-strict-overflow -fwrapv -flto -shared

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off3691215SE +/- 0.06, N = 3SE +/- 0.05, N = 311.7613.59
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off48121620Min: 11.64 / Avg: 11.76 / Max: 11.84Min: 13.52 / Avg: 13.59 / Max: 13.69

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.47, N = 3SE +/- 0.10, N = 3155.35187.85
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off306090120150Min: 154.62 / Avg: 155.35 / Max: 156.24Min: 187.71 / Avg: 187.85 / Max: 188.05

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off1.26632.53263.79895.06526.3315SE +/- 0.0031, N = 3SE +/- 0.0091, N = 35.34135.6282
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off246810Min: 5.34 / Avg: 5.34 / Max: 5.35Min: 5.61 / Avg: 5.63 / Max: 5.64

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off20406080100SE +/- 0.14, N = 3SE +/- 0.05, N = 380.0895.33
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off20406080100Min: 79.89 / Avg: 80.08 / Max: 80.37Min: 95.24 / Avg: 95.33 / Max: 95.43

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off1530456075SE +/- 0.11, N = 3SE +/- 0.02, N = 349.0367.91
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off1326395265Min: 48.83 / Avg: 49.03 / Max: 49.19Min: 67.88 / Avg: 67.91 / Max: 67.93

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.03, N = 3SE +/- 0.04, N = 328.9632.66
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 28.93 / Avg: 28.96 / Max: 29.02Min: 32.6 / Avg: 32.66 / Max: 32.73

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off48121620SE +/- 0.0172, N = 3SE +/- 0.0334, N = 39.940814.0655
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off48121620Min: 9.91 / Avg: 9.94 / Max: 9.96Min: 14.03 / Avg: 14.07 / Max: 14.13

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.11, N = 3SE +/- 0.23, N = 3125.79195.17
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off4080120160200Min: 125.56 / Avg: 125.79 / Max: 125.91Min: 194.9 / Avg: 195.17 / Max: 195.63

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off246810SE +/- 0.0061, N = 3SE +/- 0.0068, N = 35.27126.3313
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off3691215Min: 5.26 / Avg: 5.27 / Max: 5.28Min: 6.32 / Avg: 6.33 / Max: 6.34

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off306090120150SE +/- 0.12, N = 3SE +/- 0.25, N = 3111.73138.59
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off306090120150Min: 111.56 / Avg: 111.73 / Max: 111.95Min: 138.32 / Avg: 138.59 / Max: 139.08

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.07, N = 3SE +/- 0.11, N = 329.1932.75
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 29.06 / Avg: 29.19 / Max: 29.29Min: 32.59 / Avg: 32.75 / Max: 32.97

Numenta Anomaly Benchmark

Numenta Anomaly Benchmark (NAB) is a benchmark for evaluating algorithms for anomaly detection in streaming, real-time applications. It is comprised of over 50 labeled real-world and artificial time-series data files plus a novel scoring mechanism designed for real-time applications. This test profile currently measures the time to run various detectors. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Bayesian ChangepointAVX-512 OnAVX-512 Off48121620SE +/- 0.21, N = 4SE +/- 0.19, N = 516.6817.39
OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Bayesian ChangepointAVX-512 OnAVX-512 Off48121620Min: 16.36 / Avg: 16.68 / Max: 17.27Min: 17.11 / Avg: 17.39 / Max: 18.05

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Windowed GaussianAVX-512 OnAVX-512 Off1.09852.1973.29554.3945.4925SE +/- 0.032, N = 15SE +/- 0.044, N = 74.7274.882
OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Windowed GaussianAVX-512 OnAVX-512 Off246810Min: 4.55 / Avg: 4.73 / Max: 5.01Min: 4.78 / Avg: 4.88 / Max: 5.05

OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Relative EntropyAVX-512 OnAVX-512 Off3691215SE +/- 0.089, N = 5SE +/- 0.082, N = 59.89810.035
OpenBenchmarking.orgSeconds, Fewer Is BetterNumenta Anomaly Benchmark 1.1Detector: Relative EntropyAVX-512 OnAVX-512 Off3691215Min: 9.78 / Avg: 9.9 / Max: 10.25Min: 9.82 / Avg: 10.04 / Max: 10.24

OpenFOAM

OpenFOAM is the leading free, open-source software for computational fluid dynamics (CFD). This test profile currently uses the drivaerFastback test case for analyzing automotive aerodynamics or alternatively the older motorBike input. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Mesh TimeAVX-512 OnAVX-512 Off306090120150135.77144.351. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution TimeAVX-512 OnAVX-512 Off306090120150113.64135.221. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

CP2K Molecular Dynamics

CP2K is an open-source molecular dynamics software package focused on quantum chemistry and solid-state physics. This test profile currently uses the SSMP (OpenMP) version of cp2k. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterCP2K Molecular Dynamics 8.2Input: Fayalite-FISTAVX-512 OnAVX-512 Off300600900120015001122.731198.41

266 Results Shown

CPU Temperature Monitor:
  Phoronix Test Suite System Monitoring:
    Celsius
    Watts
miniBUDE:
  OpenMP - BM1
  OpenMP - BM2
OpenVINO:
  Face Detection FP16 - CPU
  Face Detection FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16 - CPU
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
  Person Detection FP16 - CPU
  Person Detection FP32 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Weld Porosity Detection FP16 - CPU
  Vehicle Detection FP16-INT8 - CPU
  Vehicle Detection FP16 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Machine Translation EN To DE FP16 - CPU
Embree
Embree
Embree:
  Pathtracer ISPC - Asian Dragon
  Pathtracer ISPC - Crown
SVT-AV1:
  Preset 13 - Bosphorus 4K
  Preset 12 - Bosphorus 4K
simdjson
simdjson:
  PartialTweets
  LargeRand
  Kostya
  DistinctUserID
  TopTweet
miniBUDE:
  OpenMP - BM1
  OpenMP - BM2
Intel Open Image Denoise
Intel Open Image Denoise:
  RT.ldr_alb_nrm.3840x2160
  RTLightmap.hdr.4096x4096
Intel Open Image Denoise:
  RT.ldr_alb_nrm.3840x2160
  RTLightmap.hdr.4096x4096
TensorFlow:
  CPU - 16 - ResNet-50
  CPU - 16 - AlexNet
  CPU - 16 - GoogLeNet
ONNX Runtime:
  fcn-resnet101-11 - CPU - Standard
  super-resolution-10 - CPU - Standard
  ArcFace ResNet-100 - CPU - Standard
ONNX Runtime:
  fcn-resnet101-11 - CPU - Standard
  super-resolution-10 - CPU - Standard
ONNX Runtime
ONNX Runtime:
  bertsquad-12 - CPU - Standard
  ArcFace ResNet-100 - CPU - Standard
OpenVKL
OSPRay:
  gravity_spheres_volume/dim_512/ao/real_time
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/pathtracer/real_time
Neural Magic DeepSparse:
  NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Stream
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Stream
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream
  CV Detection,YOLOv5s COCO - Synchronous Single-Stream
  CV Detection,YOLOv5s COCO - Asynchronous Multi-Stream
  NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream
Cpuminer-Opt:
  scrypt
  Quad SHA-256, Pyrite
  x25x
  Garlicoin
Cpuminer-Opt:
  Skeincoin
  LBC, LBRY Credits
Cpuminer-Opt:
  Skeincoin
  LBC, LBRY Credits
AI Benchmark Alpha:
  CPU Peak Freq (Highest CPU Core Frequency) Monitor:
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
    Megahertz
SMHasher
JPEG XL libjxl:
  JPEG - 90
  JPEG - 100
  PNG - 90
LeelaChessZero:
  BLAS
  Eigen
GROMACS
AI Benchmark Alpha
AI Benchmark Alpha:
  Device Inference Score
  Device Training Score
Numpy Benchmark
AI Benchmark Alpha
Numpy Benchmark
Darmstadt Automotive Parallel Heterogeneous Suite:
  OpenMP - NDT Mapping
  OpenMP - Points2Image
Neural Magic DeepSparse:
  CPU Temp Monitor:
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
    Celsius
OSPRay Studio:
  1 - 1080p - 1 - Path Tracer
  1 - 1080p - 16 - Path Tracer
  1 - 1080p - 32 - Path Tracer
  1 - 4K - 1 - Path Tracer
  1 - 4K - 16 - Path Tracer
  1 - 4K - 32 - Path Tracer
  2 - 1080p - 1 - Path Tracer
  2 - 1080p - 16 - Path Tracer
  2 - 1080p - 32 - Path Tracer
  2 - 4K - 1 - Path Tracer
  2 - 4K - 16 - Path Tracer
  2 - 4K - 32 - Path Tracer
  3 - 1080p - 1 - Path Tracer
  3 - 1080p - 16 - Path Tracer
  3 - 1080p - 32 - Path Tracer
  3 - 4K - 1 - Path Tracer
  3 - 4K - 16 - Path Tracer
  3 - 4K - 32 - Path Tracer
oneDNN:
  Convolution Batch Shapes Auto - f32 - CPU
  Deconvolution Batch shapes_1d - f32 - CPU
  Deconvolution Batch shapes_3d - f32 - CPU
  Recurrent Neural Network Training - f32 - CPU
  Recurrent Neural Network Training - bf16bf16bf16 - CPU
  Recurrent Neural Network Inference - bf16bf16bf16 - CPU
Mobile Neural Network:
  resnet-v2-50
  SqueezeNetV1.0
  inception-v3
NCNN:
  CPU - mnasnet
  CPU - efficientnet-b0
  CPU - blazeface
  CPU - googlenet
  CPU - resnet50
  CPU - regnety_400m
  CPU - vision_transformer
  CPU - FastestDet
OpenVINO:
  Face Detection FP16 - CPU
  Face Detection FP16-INT8 - CPU
  Age Gender Recognition Retail 0013 FP16 - CPU
  Age Gender Recognition Retail 0013 FP16-INT8 - CPU
  Person Detection FP16 - CPU
  Person Detection FP32 - CPU
  Weld Porosity Detection FP16-INT8 - CPU
  Weld Porosity Detection FP16 - CPU
  Vehicle Detection FP16-INT8 - CPU
  Vehicle Detection FP16 - CPU
  Person Vehicle Bike Detection FP16 - CPU
  Machine Translation EN To DE FP16 - CPU
Neural Magic DeepSparse:
  NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Stream
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream
  NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream
  NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Stream
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream
  CV Detection,YOLOv5s COCO - Synchronous Single-Stream
  CV Detection,YOLOv5s COCO - Asynchronous Multi-Stream
  NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream
Numenta Anomaly Benchmark:
  Bayesian Changepoint
  Windowed Gaussian
  Relative Entropy
OpenFOAM:
  drivaerFastback, Medium Mesh Size - Mesh Time
  drivaerFastback, Medium Mesh Size - Execution Time
CP2K Molecular Dynamics