AMD EPYC 4th Gen AVX-512 Comparison

AMD EPYC 9654 Genoa AVX-512 benchmark comparison by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212195-NE-AVXCOMPAR69
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

C/C++ Compiler Tests 3 Tests
CPU Massive 7 Tests
Creator Workloads 9 Tests
Cryptography 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 16 Tests
Machine Learning 11 Tests
Molecular Dynamics 3 Tests
Multi-Core 10 Tests
NVIDIA GPU Compute 3 Tests
Intel oneAPI 7 Tests
OpenMPI Tests 3 Tests
Python 2 Tests
Raytracing 2 Tests
Renderers 2 Tests
Scientific Computing 3 Tests
Server CPU Tests 6 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs
No Box Plots
On Line Graphs With Missing Data, Connect The Line Gaps

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
Performance Per
Dollar
Date
Run
  Test
  Duration
AVX-512 On
December 18 2022
  20 Hours, 2 Minutes
AVX-512 Off
December 18 2022
  15 Hours, 29 Minutes
Invert Hiding All Results Option
  17 Hours, 45 Minutes
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC 4th Gen AVX-512 ComparisonOpenBenchmarking.orgPhoronix Test Suite2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1002E BIOS)AMD Device 14a41520GB800GB INTEL SSDPF21Q800GBASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeUbuntu 22.106.1.0-phx (x86_64)GNOME Shell 43.0X Server 1.21.1.41.3.224GCC 12.2.0 + Clang 15.0.2-1ext41920x1080ProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen ResolutionAMD EPYC 4th Gen AVX-512 Comparison BenchmarksSystem Logs- Transparent Huge Pages: madvise- AVX-512 On: CXXFLAGS="-O3 -march=native -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mprefer-vector-width=512" CFLAGS="-O3 -march=native -mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi -mprefer-vector-width=512" - AVX-512 Off: CXXFLAGS="-O3 -march=native -mno-avx512f" CFLAGS="-O3 -march=native -mno-avx512f" - --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v - Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10110d - Python 3.10.7- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AVX-512 On vs. AVX-512 Off ComparisonPhoronix Test SuiteBaseline+46.4%+46.4%+92.8%+92.8%+139.2%+139.2%50.5%37.6%3.6%CPU - 16 - AlexNet185.4%R.N.N.T - bf16bf16bf16 - CPU155.1%D.T.S153.4%R.N.N.T - f32 - CPU152.5%W.P.D.F - CPU143.4%W.P.D.F - CPU143%F.D.F - CPU132.2%F.D.F - CPU131.9%A.G.R.R.0.F - CPU120%LBC, LBRY Credits117%Device AI Score115.1%M.T.E.T.D.F - CPU114.8%M.T.E.T.D.F - CPU114.7%W.P.D.F.I - CPU104.7%W.P.D.F.I - CPU104.5%F.D.F.I - CPU104.2%F.D.F.I - CPU103.9%V.D.F - CPU97.1%V.D.F - CPU97%CPU - 16 - GoogLeNet93.3%D.I.S92.8%V.D.F.I - CPU84.7%V.D.F.I - CPU84.6%D.B.s - f32 - CPU78.2%P.V.B.D.F - CPU76.5%P.V.B.D.F - CPU76.4%CPU - 16 - ResNet-5073.7%gravity_spheres_volume/dim_512/scivis/real_time71.4%Q.S.2.P70.9%P.D.F - CPU70%P.D.F - CPU69.5%P.D.F - CPU68.4%P.D.F - CPU67.7%gravity_spheres_volume/dim_512/ao/real_time67.4%scrypt62.9%A.G.R.R.0.F.I - CPU58.3%resnet-v2-5056.1%N.Q.A.B.b.u.S.1.P - A.M.S55.2%N.Q.A.B.b.u.S.1.P - A.M.S55.2%inception-v3Garlicoin46.5%OpenMP - BM144.1%OpenMP - BM144.1%N.Q.A.B.b.u.S.1.P - S.S.S41.5%N.Q.A.B.b.u.S.1.P - S.S.S41.5%C.C.R.5.I - A.M.S38.5%C.C.R.5.I - A.M.S38.4%R.N.N.I - bf16bf16bf16 - CPUA.G.R.R.0.F - CPU37.4%Skeincoin36.7%OpenMP - BM235.4%OpenMP - BM235.4%gravity_spheres_volume/dim_512/pathtracer/real_time33.8%x25x33.1%DistinctUserID30.7%PartialTweets30.4%C.D.Y.C - A.M.S24.1%C.D.Y.C - A.M.S24%Kostya23.3%2 - 1080p - 32 - Path Tracer23%2 - 4K - 16 - Path Tracer22.8%2 - 1080p - 16 - Path Tracer22.7%1 - 1080p - 32 - Path Tracer22.4%1 - 1080p - 1 - Path Tracer22.3%2 - 4K - 32 - Path Tracer22.2%1 - 4K - 1 - Path Tracer22.2%1 - 4K - 32 - Path Tracer22.1%1 - 4K - 16 - Path Tracer22.1%1 - 1080p - 16 - Path Tracer22%3 - 1080p - 16 - Path Tracer21.7%3 - 1080p - 32 - Path Tracer21.6%TopTweet21.2%3 - 4K - 1 - Path Tracer21.2%LargeRand21.2%N.T.C.B.b.u.S - A.M.S21%3 - 4K - 16 - Path Tracer21%N.T.C.B.b.u.S - A.M.S20.9%3 - 1080p - 1 - Path Tracer20.9%3 - 4K - 32 - Path Tracer20.7%Pathtracer ISPC - Asian Dragon20.4%Pathtracer ISPC - Asian Dragon Obj20.2%C.D.Y.C - S.S.S20.1%C.D.Y.C - S.S.S20.1%Pathtracer ISPC - Crown19.8%2 - 4K - 1 - Path Tracer19.5%2 - 1080p - 1 - Path Tracer19.5%N.T.C.D.m - A.M.S19%d.M.M.S - Execution Time19%N.T.C.D.m - A.M.S19%OpenMP - Points2Image18.7%super-resolution-10 - CPU - Standard17.7%N.T.C.B.b.u.S - S.S.S15.6%N.T.C.B.b.u.S - S.S.S15.6%vklBenchmark ISPC15.3%CPU - vision_transformer15%ArcFace ResNet-100 - CPU - Standard14.5%A.G.R.R.0.F.I - CPU12.8%N.T.C.B.b.u.c - S.S.S12.8%N.T.C.B.b.u.c - S.S.S12.8%N.D.C.o.b.u.o.I - S.S.S12.2%N.D.C.o.b.u.o.I - S.S.S12.2%CPU - blazeface12.1%Eigen11.4%CPU - regnety_400m9.2%F.x.A9.1%CPU - efficientnet-b08.9%fcn-resnet101-11 - CPU - Standard8.4%BLAS7.8%Fayalite-FIST6.7%SqueezeNetV1.06.6%d.M.M.S - Mesh Time6.3%N.T.C.D.m - S.S.S5.4%N.T.C.D.m - S.S.S5.4%Preset 12 - Bosphorus 4K5%4.9%JPEG - 904.6%CPU - googlenet4.6%PNG - 904.5%CPU - mnasnet4.5%B.C4.3%C.B.S.A - f32 - CPU4.2%RTLightmap.hdr.4096x4096bertsquad-12 - CPU - Standard3.4%Windowed Gaussian3.3%CPU - FastestDet3%Preset 13 - Bosphorus 4K2.8%JPEG - 1002.8%OpenMP - NDT Mapping2.7%TensorFlowoneDNNAI Benchmark AlphaoneDNNOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOCpuminer-OptAI Benchmark AlphaOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOOpenVINOTensorFlowAI Benchmark AlphaOpenVINOOpenVINOoneDNNOpenVINOOpenVINOTensorFlowOSPRayCpuminer-OptOpenVINOOpenVINOOpenVINOOpenVINOOSPRayCpuminer-OptOpenVINOMobile Neural NetworkNeural Magic DeepSparseNeural Magic DeepSparseMobile Neural NetworkCpuminer-OptminiBUDEminiBUDENeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseoneDNNOpenVINOCpuminer-OptminiBUDEminiBUDEOSPRayCpuminer-OptsimdjsonsimdjsonNeural Magic DeepSparseNeural Magic DeepSparsesimdjsonOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudioOSPRay StudiosimdjsonOSPRay StudiosimdjsonNeural Magic DeepSparseOSPRay StudioNeural Magic DeepSparseOSPRay StudioOSPRay StudioEmbreeEmbreeNeural Magic DeepSparseNeural Magic DeepSparseEmbreeOSPRay StudioOSPRay StudioNeural Magic DeepSparseOpenFOAMNeural Magic DeepSparseDarmstadt Automotive Parallel Heterogeneous SuiteONNX RuntimeNeural Magic DeepSparseNeural Magic DeepSparseOpenVKLNCNNONNX RuntimeOpenVINONeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseNeural Magic DeepSparseNCNNLeelaChessZeroNCNNSMHasherNCNNONNX RuntimeLeelaChessZeroCP2K Molecular DynamicsMobile Neural NetworkOpenFOAMNeural Magic DeepSparseNeural Magic DeepSparseSVT-AV1Numpy BenchmarkJPEG XL libjxlNCNNJPEG XL libjxlNCNNNumenta Anomaly BenchmarkoneDNNIntel Open Image DenoiseONNX RuntimeNumenta Anomaly BenchmarkNCNNSVT-AV1JPEG XL libjxlDarmstadt Automotive Parallel Heterogeneous SuiteAVX-512 OnAVX-512 Off

AMD EPYC 4th Gen AVX-512 Comparisonai-benchmark: Device Inference Scoreai-benchmark: Device Training Scoreai-benchmark: Device AI Scoredeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection,YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamtensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 16 - AlexNettensorflow: CPU - 16 - GoogLeNetlczero: BLASlczero: Eigenembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Asian Dragon Objembree: Pathtracer ISPC - Crownopenvkl: vklBenchmark ISPCospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timeospray-studio: 1 - 1080p - 1 - Path Tracerospray-studio: 1 - 1080p - 16 - Path Tracerospray-studio: 1 - 1080p - 32 - Path Tracerospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 1 - 4K - 16 - Path Tracerospray-studio: 1 - 4K - 32 - Path Tracerospray-studio: 2 - 1080p - 1 - Path Tracerospray-studio: 2 - 1080p - 16 - Path Tracerospray-studio: 2 - 1080p - 32 - Path Tracerospray-studio: 2 - 4K - 1 - Path Tracerospray-studio: 2 - 4K - 16 - Path Tracerospray-studio: 2 - 4K - 32 - Path Tracerospray-studio: 3 - 1080p - 1 - Path Tracerospray-studio: 3 - 1080p - 16 - Path Tracerospray-studio: 3 - 1080p - 32 - Path Tracerospray-studio: 3 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 16 - Path Tracerospray-studio: 3 - 4K - 32 - Path Traceronednn: Convolution Batch Shapes Auto - f32 - CPUonednn: Deconvolution Batch shapes_1d - f32 - CPUonednn: Deconvolution Batch shapes_3d - f32 - CPUonednn: Recurrent Neural Network Training - f32 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUmnn: resnet-v2-50mnn: SqueezeNetV1.0mnn: inception-v3cpuminer-opt: scryptcpuminer-opt: Quad SHA-256, Pyritecpuminer-opt: x25xcpuminer-opt: Garlicoincpuminer-opt: Skeincoincpuminer-opt: LBC, LBRY Creditsncnn: CPU - mnasnetncnn: CPU - efficientnet-b0ncnn: CPU - blazefacencnn: CPU - googlenetncnn: CPU - resnet50ncnn: CPU - regnety_400mncnn: CPU - vision_transformerncnn: CPU - FastestDetopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUgromacs: MPI CPU - water_GMX50_bareonnx: fcn-resnet101-11 - CPU - Standardonnx: super-resolution-10 - CPU - Standardonnx: bertsquad-12 - CPU - Standardonnx: ArcFace ResNet-100 - CPU - Standardnumpy: minibude: OpenMP - BM1minibude: OpenMP - BM1minibude: OpenMP - BM2minibude: OpenMP - BM2svt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Knumenta-nab: Bayesian Changepointnumenta-nab: Windowed Gaussiannumenta-nab: Relative Entropyoidn: RT.hdr_alb_nrm.3840x2160oidn: RT.ldr_alb_nrm.3840x2160oidn: RTLightmap.hdr.4096x4096daphne: OpenMP - NDT Mappingdaphne: OpenMP - Points2Imagesimdjson: PartialTweetssimdjson: LargeRandsimdjson: Kostyasimdjson: DistinctUserIDsimdjson: TopTweetopenfoam: drivaerFastback, Medium Mesh Size - Mesh Timeopenfoam: drivaerFastback, Medium Mesh Size - Execution Timecp2k: Fayalite-FISTsmhasher: FarmHash32 x86_64 AVXsmhasher: FarmHash32 x86_64 AVXjpegxl: JPEG - 90jpegxl: JPEG - 100jpegxl: PNG - 90AVX-512 OnAVX-512 Off35102701621184.998811.7586616.6804155.3462187.08375.34131195.936780.08261953.025549.032834.520628.9609100.54589.9408761.2241125.7860189.56115.2712857.2265111.728234.251829.189222.15157.2960.1790779096212.9364184.4701180.9315133244.156643.071154.218714823534698581927118546154238447646049385188081772804560869411070221840.51616722.58750.9895911962.061955.392361.2715.4418.57945.8124800.6122606807696.06715531951330106695042.3057.7125.8872.8066.34247.3274.9358.95102.04469.29193.93246.98148967.980.55170652.710.3643.341100.2543.341101.0019800.209.639988.444.7911202.624.287452.966.439065.345.29956.8850.1118.76427174015161051575.667299.545291.9828652.005346.081246.572245.55516.6774.7279.8983.513.511.661371.1813677.7644948436.701.264.186.866.52135.77418113.637331122.72539794.4426.3879.550.749.9518211066288773.538013.5907509.4462187.8540177.53875.62821005.161595.33471410.805167.911830.606232.664771.043714.0655490.3658195.1656157.8196.3313690.8722138.586130.526732.750512.7555.1131.1284238162176.8831153.4476151.0558115526.379425.135440.51171812871574971011321226471842924586172211524229852143413682084113398267850.53768222.83541.763364953.984989.131715.7624.0989.14630.4362946.0813229605784.2548827142754349174044.1962.8629.0076.1367.48270.1286.1660.7343.941088.5094.97503.50108449.491.21151239.600.5725.501864.9325.741845.959672.9419.694110.4911.666065.767.903782.0812.675135.569.33445.59107.6218.4672506288499918548.835065.096202.6046391.519255.661239.881233.94417.3894.88210.0353.503.511.721335.6311521.385.141.043.395.255.38144.3478135.216481198.40836483.7026.4129.130.729.52OpenBenchmarking.org

AI Benchmark Alpha

AI Benchmark Alpha is a Python library for evaluating artificial intelligence (AI) performance on diverse hardware platforms and relies upon the TensorFlow machine learning library. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Inference ScoreAVX-512 OnAVX-512 Off800160024003200400035101821

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device Training ScoreAVX-512 OnAVX-512 Off600120018002400300027011066

OpenBenchmarking.orgScore, More Is BetterAI Benchmark Alpha 0.1.2Device AI ScoreAVX-512 OnAVX-512 Off1300260039005200650062112887

Neural Magic DeepSparse

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off20406080100SE +/- 0.45, N = 3SE +/- 0.27, N = 385.0073.54
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off1632486480Min: 84.41 / Avg: 85 / Max: 85.88Min: 73.03 / Avg: 73.54 / Max: 73.92

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off3691215SE +/- 0.06, N = 3SE +/- 0.05, N = 311.7613.59
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off48121620Min: 11.64 / Avg: 11.76 / Max: 11.84Min: 13.52 / Avg: 13.59 / Max: 13.69

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off130260390520650SE +/- 1.92, N = 3SE +/- 0.33, N = 3616.68509.45
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off110220330440550Min: 613.23 / Avg: 616.68 / Max: 619.87Min: 508.96 / Avg: 509.45 / Max: 510.07

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.47, N = 3SE +/- 0.10, N = 3155.35187.85
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off306090120150Min: 154.62 / Avg: 155.35 / Max: 156.24Min: 187.71 / Avg: 187.85 / Max: 188.05

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.11, N = 3SE +/- 0.29, N = 3187.08177.54
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off306090120150Min: 186.88 / Avg: 187.08 / Max: 187.25Min: 177.08 / Avg: 177.54 / Max: 178.07

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off1.26632.53263.79895.06526.3315SE +/- 0.0031, N = 3SE +/- 0.0091, N = 35.34135.6282
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off246810Min: 5.34 / Avg: 5.34 / Max: 5.35Min: 5.61 / Avg: 5.63 / Max: 5.64

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off30060090012001500SE +/- 2.39, N = 3SE +/- 0.78, N = 31195.941005.16
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off2004006008001000Min: 1191.45 / Avg: 1195.94 / Max: 1199.62Min: 1003.93 / Avg: 1005.16 / Max: 1006.6

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off20406080100SE +/- 0.14, N = 3SE +/- 0.05, N = 380.0895.33
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off20406080100Min: 79.89 / Avg: 80.08 / Max: 80.37Min: 95.24 / Avg: 95.33 / Max: 95.43

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off400800120016002000SE +/- 4.32, N = 3SE +/- 0.17, N = 31953.031410.81
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off30060090012001500Min: 1946.33 / Avg: 1953.03 / Max: 1961.1Min: 1410.51 / Avg: 1410.81 / Max: 1411.1

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off1530456075SE +/- 0.11, N = 3SE +/- 0.02, N = 349.0367.91
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off1326395265Min: 48.83 / Avg: 49.03 / Max: 49.19Min: 67.88 / Avg: 67.91 / Max: 67.93

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.03, N = 3SE +/- 0.03, N = 334.5230.61
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 34.45 / Avg: 34.52 / Max: 34.56Min: 30.54 / Avg: 30.61 / Max: 30.66

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.03, N = 3SE +/- 0.04, N = 328.9632.66
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 28.93 / Avg: 28.96 / Max: 29.02Min: 32.6 / Avg: 32.66 / Max: 32.73

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off20406080100SE +/- 0.17, N = 3SE +/- 0.17, N = 3100.5571.04
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off20406080100Min: 100.37 / Avg: 100.55 / Max: 100.89Min: 70.71 / Avg: 71.04 / Max: 71.24

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off48121620SE +/- 0.0172, N = 3SE +/- 0.0334, N = 39.940814.0655
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off48121620Min: 9.91 / Avg: 9.94 / Max: 9.96Min: 14.03 / Avg: 14.07 / Max: 14.13

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off160320480640800SE +/- 0.79, N = 3SE +/- 0.61, N = 3761.22490.37
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off130260390520650Min: 760.22 / Avg: 761.22 / Max: 762.79Min: 489.16 / Avg: 490.37 / Max: 491.14

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.11, N = 3SE +/- 0.23, N = 3125.79195.17
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off4080120160200Min: 125.56 / Avg: 125.79 / Max: 125.91Min: 194.9 / Avg: 195.17 / Max: 195.63

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off4080120160200SE +/- 0.22, N = 3SE +/- 0.17, N = 3189.56157.82
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off306090120150Min: 189.14 / Avg: 189.56 / Max: 189.86Min: 157.49 / Avg: 157.82 / Max: 158.08

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off246810SE +/- 0.0061, N = 3SE +/- 0.0068, N = 35.27126.3313
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off3691215Min: 5.26 / Avg: 5.27 / Max: 5.28Min: 6.32 / Avg: 6.33 / Max: 6.34

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off2004006008001000SE +/- 0.77, N = 3SE +/- 1.12, N = 3857.23690.87
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off150300450600750Min: 855.84 / Avg: 857.23 / Max: 858.52Min: 688.64 / Avg: 690.87 / Max: 692.09

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off306090120150SE +/- 0.12, N = 3SE +/- 0.25, N = 3111.73138.59
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-StreamAVX-512 OnAVX-512 Off306090120150Min: 111.56 / Avg: 111.73 / Max: 111.95Min: 138.32 / Avg: 138.59 / Max: 139.08

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.08, N = 3SE +/- 0.10, N = 334.2530.53
OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 34.13 / Avg: 34.25 / Max: 34.41Min: 30.33 / Avg: 30.53 / Max: 30.67

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off816243240SE +/- 0.07, N = 3SE +/- 0.11, N = 329.1932.75
OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-StreamAVX-512 OnAVX-512 Off714212835Min: 29.06 / Avg: 29.19 / Max: 29.29Min: 32.59 / Avg: 32.75 / Max: 32.97

TensorFlow

This is a benchmark of the TensorFlow deep learning framework using the TensorFlow reference benchmarks (tensorflow/benchmarks with tf_cnn_benchmarks.py). Note with the Phoronix Test Suite there is also pts/tensorflow-lite for benchmarking the TensorFlow Lite binaries too. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: ResNet-50AVX-512 OnAVX-512 Off510152025SE +/- 0.11, N = 3SE +/- 0.01, N = 322.1512.75
OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: ResNet-50AVX-512 OnAVX-512 Off510152025Min: 21.93 / Avg: 22.15 / Max: 22.31Min: 12.72 / Avg: 12.75 / Max: 12.77

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: AlexNetAVX-512 OnAVX-512 Off306090120150SE +/- 2.35, N = 12SE +/- 0.19, N = 3157.2955.11
OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: AlexNetAVX-512 OnAVX-512 Off306090120150Min: 140.38 / Avg: 157.29 / Max: 168.09Min: 54.87 / Avg: 55.11 / Max: 55.49

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: GoogLeNetAVX-512 OnAVX-512 Off1326395265SE +/- 1.01, N = 15SE +/- 0.25, N = 360.1731.12
OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: GoogLeNetAVX-512 OnAVX-512 Off1224364860Min: 54.32 / Avg: 60.17 / Max: 63.8Min: 30.8 / Avg: 31.12 / Max: 31.61

LeelaChessZero

LeelaChessZero (lc0 / lczero) is a chess engine automated vian neural networks. This test profile can be used for OpenCL, CUDA + cuDNN, and BLAS (CPU-based) benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 103.04, N = 4SE +/- 21.53, N = 390778423-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASAVX-512 OnAVX-512 Off16003200480064008000Min: 8776 / Avg: 9076.5 / Max: 9244Min: 8380 / Avg: 8422.67 / Max: 84491. (CXX) g++ options: -flto -O3 -march=native -pthread

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAVX-512 OnAVX-512 Off2K4K6K8K10KSE +/- 45.37, N = 3SE +/- 43.59, N = 390968162-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -flto -O3 -march=native -pthread
OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: EigenAVX-512 OnAVX-512 Off16003200480064008000Min: 9020 / Avg: 9096.33 / Max: 9177Min: 8076 / Avg: 8162.33 / Max: 82161. (CXX) g++ options: -flto -O3 -march=native -pthread

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OnAVX-512 Off50100150200250SE +/- 0.22, N = 9SE +/- 0.21, N = 8212.94176.88MIN: 207.72 / MAX: 227.11MIN: 170.19 / MAX: 190
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian DragonAVX-512 OnAVX-512 Off4080120160200Min: 211.85 / Avg: 212.94 / Max: 214.13Min: 176.2 / Avg: 176.88 / Max: 178.02

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon ObjAVX-512 OnAVX-512 Off4080120160200SE +/- 0.62, N = 4SE +/- 0.36, N = 4184.47153.45MIN: 178.83 / MAX: 196.34MIN: 130.31 / MAX: 166.17
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon ObjAVX-512 OnAVX-512 Off306090120150Min: 183.33 / Avg: 184.47 / Max: 185.82Min: 152.46 / Avg: 153.45 / Max: 154.16

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownAVX-512 OnAVX-512 Off4080120160200SE +/- 0.62, N = 8SE +/- 0.25, N = 7180.93151.06MIN: 124.41 / MAX: 209.74MIN: 114.4 / MAX: 176.43
OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: CrownAVX-512 OnAVX-512 Off306090120150Min: 178.03 / Avg: 180.93 / Max: 183.42Min: 150.19 / Avg: 151.06 / Max: 151.76

OpenVKL

OpenVKL is the Intel Open Volume Kernel Library that offers high-performance volume computation kernels and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCAVX-512 OnAVX-512 Off30060090012001500SE +/- 13.58, N = 3SE +/- 8.11, N = 313321155MIN: 329 / MAX: 4770MIN: 251 / MAX: 5181
OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCAVX-512 OnAVX-512 Off2004006008001000Min: 1308 / Avg: 1332 / Max: 1355Min: 1140 / Avg: 1154.67 / Max: 1168

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_timeAVX-512 OnAVX-512 Off1020304050SE +/- 0.21, N = 3SE +/- 0.02, N = 344.1626.38
OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_timeAVX-512 OnAVX-512 Off918273645Min: 43.75 / Avg: 44.16 / Max: 44.43Min: 26.34 / Avg: 26.38 / Max: 26.42

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeAVX-512 OnAVX-512 Off1020304050SE +/- 0.10, N = 3SE +/- 0.04, N = 343.0725.14
OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_timeAVX-512 OnAVX-512 Off918273645Min: 42.93 / Avg: 43.07 / Max: 43.27Min: 25.05 / Avg: 25.14 / Max: 25.19

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeAVX-512 OnAVX-512 Off1224364860SE +/- 0.05, N = 3SE +/- 0.03, N = 354.2240.51
OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timeAVX-512 OnAVX-512 Off1122334455Min: 54.14 / Avg: 54.22 / Max: 54.3Min: 40.45 / Avg: 40.51 / Max: 40.56

OSPRay Studio

Intel OSPRay Studio is an open-source, interactive visualization and ray-tracing software package. OSPRay Studio makes use of Intel OSPRay, a portable ray-tracing engine for high-performance, high-fidelity visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off4080120160200SE +/- 0.33, N = 3SE +/- 0.33, N = 3148181-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 1 - Renderer: Path TracerAVX-512 OnAVX-512 Off306090120150Min: 148 / Avg: 148.33 / Max: 149Min: 180 / Avg: 180.67 / Max: 1811. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off6001200180024003000SE +/- 2.33, N = 3SE +/- 1.20, N = 323532871-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl
OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 16 - Renderer: Path TracerAVX-512 OnAVX-512 Off5001000150020002500Min: 2349 / Avg: 2352.67 / Max: 2357Min: 2869 / Avg: 2870.67 / Max: 28731. (CXX) g++ options: -O3 -march=native -ldl

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 1080p - Samples Per Pixel: 32 - Renderer: Path TracerAVX-512 OnAVX-512 Off12002400360048006000SE +/- 3.71, N = 3SE +/- 6.56, N = 346985749-mavx512f -mavx512cd -mavx512vl -mavx512bw -mavx512dq -mavx512ifma -mavx512vbmi-mno-avx512f1. (CXX) g++ options: -O3 -march=native -ldl