AMD EPYC Genoa Memory Scaling

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2212240-NE-AMDEPYCGE62&grr&rdt&rro.

AMD EPYC Genoa Memory ScalingProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolution12c10c8c6c2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1002E BIOS)AMD Device 14a41520GB800GB INTEL SSDPF21Q800GBASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeUbuntu 22.106.1.0-phx (x86_64)GNOME Shell 43.0X Server 1.21.1.41.3.224GCC 12.2.0 + Clang 15.0.2-1ext41920x10801264GB1008GB768GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10110dJava Details- OpenJDK Runtime Environment (build 11.0.17+8-post-Ubuntu-1ubuntu2)Python Details- Python 3.10.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa Memory Scalingwrf: conus 2.5kmhpcg: openvkl: vklBenchmark ISPCincompact3d: X3D-benchmarking input.i3dnwchem: C240 Buckyballcockroach: KV, 10% Reads - 512cockroach: KV, 95% Reads - 1024cockroach: KV, 60% Reads - 512cockroach: KV, 50% Reads - 1024ospray: particle_volume/scivis/real_timecockroach: KV, 50% Reads - 512cockroach: KV, 95% Reads - 512ospray: particle_volume/pathtracer/real_timerelion: Basic - CPUtensorflow: CPU - 256 - ResNet-50luxcorerender: Danish Mood - CPUonnx: fcn-resnet101-11 - CPU - Standardluxcorerender: Orange Juice - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUopenradioss: Bird Strike on Windshieldcassandra: Writesonednn: Recurrent Neural Network Training - u8s8f32 - CPUospray: particle_volume/ao/real_timecockroach: KV, 60% Reads - 1024onednn: Recurrent Neural Network Inference - u8s8f32 - CPUgraph500: 26build-linux-kernel: allmodconfigbuild-gem5: Time To Compileospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timeospray: gravity_spheres_volume/dim_512/pathtracer/real_timecockroach: KV, 10% Reads - 1024deepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streambuild-nodejs: Time To Compileopenradioss: INIVOL and Fluid Structure Interaction Drop Containeropenvino: Age Gender Recognition Retail 0013 FP16 - CPUopenvino: Age Gender Recognition Retail 0013 FP16 - CPUcockroach: MoVR - 512cockroach: MoVR - 1024openfoam: drivaerFastback, Medium Mesh Size - Execution Timenginx: 500build-linux-kernel: defconfigsimdjson: TopTweetopenradioss: Bumper Beamblender: Barbershop - CPU-Onlydeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUbuild-llvm: Ninjasimdjson: DistinctUserIDsimdjson: PartialTweetsopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUavifenc: 0namd: ATPase Simulation - 327,506 Atomsopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUstargate: 192000 - 1024openvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16-INT8 - CPUopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamopenvino: Weld Porosity Detection FP16 - CPUopenvino: Weld Porosity Detection FP16 - CPUdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamsimdjson: Kostyadeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdacapobench: H2simdjson: LargeRandbuild2: Time To Compilebuild-php: Time To Compiledeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streambuild-gdb: Time To Compiledeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingnekrs: TurboPipe Periodicstargate: 96000 - 1024build-godot: Time To Compileavifenc: 2gpaw: Carbon Nanotubesvt-av1: Preset 12 - Bosphorus 4Krodinia: OpenMP Streamclustergromacs: MPI CPU - water_GMX50_bareonednn: IP Shapes 3D - bf16bf16bf16 - CPUnpb: IS.Dblender: Classroom - CPU-Onlybuild-apache: Time To Compilebuild-mesa: Time To Compileliquid-dsp: 384 - 256 - 57liquid-dsp: 256 - 256 - 57oidn: RTLightmap.hdr.4096x4096minibude: OpenMP - BM2minibude: OpenMP - BM2onednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUkvazaar: Bosphorus 4K - Very Fastxmrig: Monero - 1Mnpb: CG.Ckvazaar: Bosphorus 4K - Mediumxmrig: Wownero - 1Moidn: RT.hdr_alb_nrm.3840x2160blender: BMW27 - CPU-Onlynpb: MG.Cnpb: SP.Ckvazaar: Bosphorus 4K - Ultra Fastastcenc: Exhaustivebuild-mplayer: Time To Compilenpb: LU.Castcenc: Thoroughdacapobench: Jythonrodinia: OpenMP CFD Solveravifenc: 6, Losslessavifenc: 10, Losslessembree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonmt-dgemm: Sustained Floating-Point Rateavifenc: 612c10c8c6c4070.1986.81431325125.5262481537.135970.064661.852330.147465.542.797047621.964467.6229.269128.101109.139.6925428.822275.86216.882517931968.7043.706152573.32344.29565152000147.147139.23843.126243.977553.774836846.9125.7196761.4853101.46581.570.55147769.26948.5953.8109.53721201032.0625.5016.5979.8681.03111.8909856.018649.98959.1675.6556.865.651109.4542.981110.6842.95250.34191.43470.98101.7463.2470.127835.309038.479.9519171.510.97119606.212.8290614.3511018.376.487394.651133.282384.35064.859867.411133.477484.24704.11155.4797615.447448021.2549.91744.51980.07901195.910241.70948.76731964.273011814359231768214620000004.34589034.03234.84823.151251.7696.00118.7063.954718491.0120.9220.46120.11710347000000103470000001.65345.6128640.3100.44693073.44104604.680225.0162.56126465.63.528.58209846.76260471.5077.8311.72507.777489164.65106.566333806.0505.2874.241182.4498213.750770.4077332.4594563.18348.29451317146.289830153135993.162029.851748.848449.042.999949102.760769.7230.282151.398105.919.6225528.192325.71218.222436032030.7243.031651959.52438.00574018000145.410134.37343.328743.996954.408335776.8128.9249742.7956101.94181.150.55147717.32949.6949.5117.94003198858.6625.4076.4979.7080.37113.4079844.426851.29934.7175.4406.845.671110.4442.941104.5943.18249.12192.30469.43102.0163.2460.127595.289063.849.9119254.080.98122938.232.8061904.3311066.166.457425.101133.182184.48224.839900.471135.184584.26574.11156.5374611.292648321.2549.80044.60879.71391201.139142.41248.74311965.560611716278934337862580000004.35455633.61634.90923.373241.3696.28518.6774.009387124.9220.7620.48020.20510352666667103400000001.63346.6798666.9800.46345475.35102599.681179.0062.23127226.63.448.42177097.42239496.0177.3011.76377.755489995.20106.854233296.0745.2864.337184.7346214.309370.6137752.4116551.87645.00051325270.0912711519.634832.958195.552515.247498.143.844247596.664111.9228.581221.336105.019.5625729.042371.78219.452408541982.1543.970052559.02375.45531854000147.377136.79343.431044.230254.508736685.7135.6212705.7116101.14981.090.55152292.39960.3946.9166.14971197081.9825.5286.5779.2080.18123.8576773.068654.80875.3975.7256.865.661119.7942.591129.0142.22249.26192.25472.84101.2662.9610.127685.269113.119.9019278.930.98123571.682.8115554.3111108.166.497389.001136.854484.21154.829931.491137.511984.15464.11155.8191614.610547311.2549.87144.58379.68651201.983942.40948.99821954.122711599018794307402470000004.35140233.90534.68724.598227.8986.01818.6783.993056675.7120.6820.58920.10710349666667103376666671.64344.6398615.9670.46579673.04101953.579784.1561.81127081.23.478.34153458.78208535.2376.8411.80907.808466769.54107.110833695.9705.2704.252185.4907217.406071.0103232.4207432.65536.54111212348.8800251517.935742.360137.351275.147593.943.239647428.062666.5230.440258.50095.679.4925328.902471.57219.102468822072.5743.357552626.42479.62392496000145.766134.69543.289944.271654.605436329.6166.4322575.7518102.77680.810.54151213.17954.7952.7227.89595196805.3024.7476.5579.6279.93150.9167635.024658.67817.2776.7476.835.691153.7041.331150.5441.44250.49191.29473.69101.0863.8030.128205.289081.739.8919314.040.97121027.252.8248144.3011150.326.567306.471148.496482.48694.819959.381148.327882.26134.11157.2164608.533648301.2450.08444.69880.43991190.528643.24549.62781930.327711774848249266595543333334.36476733.67134.87426.308221.1616.40917.9403.964885690.0120.7120.72020.15710349000000103403333331.54346.0778651.9240.46505971.41100446.271662.2861.40126057.73.298.33117733.57167474.7075.8611.82077.773454360.62106.509533456.1525.3304.250187.6107221.289870.8983122.435OpenBenchmarking.org

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km6c8c10c12c160032004800640080007432.666551.884563.184070.191. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.16c8c10c12c20406080100SE +/- 0.99, N = 9SE +/- 0.49, N = 9SE +/- 3.31, N = 9SE +/- 1.12, N = 1236.5445.0048.2986.811. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPC6c8c10c12c30060090012001500SE +/- 15.59, N = 3SE +/- 8.82, N = 3SE +/- 11.03, N = 9SE +/- 6.93, N = 31212132513171325MIN: 328 / MAX: 4115MIN: 330 / MAX: 5664MIN: 327 / MAX: 5660MIN: 329 / MAX: 4553

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d6c8c10c12c80160240320400SE +/- 4.79, N = 9SE +/- 2.69, N = 9SE +/- 0.11, N = 3SE +/- 0.14, N = 3348.88270.09146.29125.531. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball6c8c10c12c300600900120015001517.91519.61531.01537.11. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

CockroachDB

Workload: KV, 10% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 10% Reads - Concurrency: 5126c8c10c12c8K16K24K32K40KSE +/- 438.30, N = 15SE +/- 351.71, N = 6SE +/- 270.36, N = 15SE +/- 343.66, N = 1535742.334832.935993.135970.0

CockroachDB

Workload: KV, 95% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 10246c8c10c12c14K28K42K56K70KSE +/- 1310.27, N = 15SE +/- 1317.65, N = 15SE +/- 1142.40, N = 15SE +/- 575.30, N = 360137.358195.562029.864661.8

CockroachDB

Workload: KV, 60% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 60% Reads - Concurrency: 5126c8c10c12c11K22K33K44K55KSE +/- 555.56, N = 15SE +/- 411.73, N = 13SE +/- 620.92, N = 15SE +/- 268.61, N = 351275.152515.251748.852330.1

CockroachDB

Workload: KV, 50% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 10246c8c10c12c10K20K30K40K50KSE +/- 391.13, N = 9SE +/- 468.66, N = 15SE +/- 380.16, N = 3SE +/- 366.75, N = 1547593.947498.148449.047465.5

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/scivis/real_time6c8c10c12c1020304050SE +/- 0.06, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 343.2443.8443.0042.80

CockroachDB

Workload: KV, 50% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 5126c8c10c12c11K22K33K44K55KSE +/- 32.88, N = 3SE +/- 454.84, N = 15SE +/- 514.54, N = 3SE +/- 464.03, N = 1547428.047596.649102.747621.9

CockroachDB

Workload: KV, 95% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 5126c8c10c12c14K28K42K56K70KSE +/- 813.26, N = 15SE +/- 890.57, N = 3SE +/- 1044.13, N = 15SE +/- 702.29, N = 362666.564111.960769.764467.6

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/pathtracer/real_time6c8c10c12c50100150200250SE +/- 0.59, N = 3SE +/- 1.74, N = 3SE +/- 1.94, N = 3SE +/- 1.54, N = 3230.44228.58230.28229.27

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU6c8c10c12c60120180240300SE +/- 2.59, N = 6SE +/- 2.88, N = 3SE +/- 1.86, N = 4SE +/- 1.38, N = 5258.50221.34151.40128.101. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -lmpi_cxx -lmpi

TensorFlow

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 256 - Model: ResNet-506c8c10c12c20406080100SE +/- 0.26, N = 3SE +/- 0.48, N = 3SE +/- 0.36, N = 3SE +/- 0.48, N = 395.67105.01105.91109.13

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU6c8c10c12c3691215SE +/- 0.14, N = 12SE +/- 0.11, N = 15SE +/- 0.17, N = 12SE +/- 0.09, N = 159.499.569.629.69MIN: 3.85 / MAX: 12.15MIN: 3.94 / MAX: 12.41MIN: 3.97 / MAX: 12.9MIN: 4 / MAX: 12.39

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standard6c8c10c12c60120180240300SE +/- 2.17, N = 12SE +/- 2.84, N = 5SE +/- 3.09, N = 3SE +/- 2.33, N = 72532572552541. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto=auto -fno-fat-lto-objects -ldl -lrt

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU6c8c10c12c714212835SE +/- 0.71, N = 15SE +/- 0.72, N = 15SE +/- 0.29, N = 3SE +/- 0.63, N = 1528.9029.0428.1928.82MIN: 22.4 / MAX: 44.91MIN: 22.62 / MAX: 45.48MIN: 23.3 / MAX: 45.65MIN: 23.01 / MAX: 45.86

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU6c8c10c12c5001000150020002500SE +/- 31.16, N = 3SE +/- 25.14, N = 15SE +/- 25.04, N = 15SE +/- 24.22, N = 32471.572371.782325.712275.86MIN: 2410.73MIN: 2234.23MIN: 2171.69MIN: 2213.341. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshield6c8c10c12c50100150200250SE +/- 0.14, N = 3SE +/- 0.19, N = 3SE +/- 0.54, N = 3SE +/- 0.38, N = 3219.10219.45218.22216.88

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Writes6c8c10c12c50K100K150K200K250KSE +/- 2957.03, N = 3SE +/- 1899.17, N = 3SE +/- 2429.87, N = 3SE +/- 3742.45, N = 12246882240854243603251793

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU6c8c10c12c400800120016002000SE +/- 16.27, N = 10SE +/- 28.30, N = 3SE +/- 14.89, N = 3SE +/- 31.84, N = 152072.571982.152030.721968.70MIN: 1942.14MIN: 1911.33MIN: 1981.15MIN: 1632.621. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/ao/real_time6c8c10c12c1020304050SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 343.3643.9743.0343.71

CockroachDB

Workload: KV, 60% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 60% Reads - Concurrency: 10246c8c10c12c11K22K33K44K55KSE +/- 448.33, N = 3SE +/- 447.89, N = 3SE +/- 400.61, N = 10SE +/- 239.52, N = 352626.452559.051959.552573.3

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU6c8c10c12c5001000150020002500SE +/- 25.74, N = 15SE +/- 21.41, N = 3SE +/- 30.76, N = 3SE +/- 21.01, N = 32479.622375.452438.002344.29MIN: 2293.49MIN: 2319.45MIN: 2353.97MIN: 2288.851. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 266c8c10c12c120M240M360M480M600M3924960005318540005740180005651520001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfig6c8c10c12c306090120150SE +/- 0.14, N = 3SE +/- 1.03, N = 3SE +/- 0.72, N = 3SE +/- 0.90, N = 3145.77147.38145.41147.15

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compile6c8c10c12c306090120150SE +/- 0.57, N = 3SE +/- 0.77, N = 3SE +/- 0.36, N = 3SE +/- 0.16, N = 3134.70136.79134.37139.24

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_time6c8c10c12c1020304050SE +/- 0.15, N = 3SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.15, N = 343.2943.4343.3343.13

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_time6c8c10c12c1020304050SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.13, N = 344.2744.2344.0043.98

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time6c8c10c12c1224364860SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.50, N = 354.6154.5154.4153.77

CockroachDB

Workload: KV, 10% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 10% Reads - Concurrency: 10246c8c10c12c8K16K24K32K40KSE +/- 206.35, N = 3SE +/- 322.68, N = 3SE +/- 346.25, N = 3SE +/- 155.07, N = 336329.636685.735776.836846.9

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream6c8c10c12c4080120160200SE +/- 1.66, N = 15SE +/- 0.38, N = 3SE +/- 0.43, N = 3SE +/- 0.11, N = 3166.43135.62128.92125.72

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream6c8c10c12c160320480640800SE +/- 6.13, N = 15SE +/- 2.11, N = 3SE +/- 2.41, N = 3SE +/- 0.72, N = 3575.75705.71742.80761.49

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 18.8Time To Compile6c8c10c12c20406080100SE +/- 0.06, N = 3SE +/- 0.22, N = 3SE +/- 0.29, N = 3SE +/- 0.26, N = 3102.78101.15101.94101.47

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Container6c8c10c12c20406080100SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 380.8181.0981.1581.57

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU6c8c10c12c0.12380.24760.37140.49520.619SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 10SE +/- 0.00, N = 30.540.550.550.55MIN: 0.5 / MAX: 34.19MIN: 0.5 / MAX: 30.68MIN: 0.5 / MAX: 41.23MIN: 0.5 / MAX: 34.711. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU6c8c10c12c30K60K90K120K150KSE +/- 365.43, N = 3SE +/- 994.61, N = 3SE +/- 1134.97, N = 10SE +/- 745.28, N = 3151213.17152292.39147717.32147769.261. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

CockroachDB

Workload: MoVR - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: MoVR - Concurrency: 5126c8c10c12c2004006008001000SE +/- 4.87, N = 3SE +/- 9.03, N = 3SE +/- 3.66, N = 3SE +/- 3.38, N = 3954.7960.3949.6948.5

CockroachDB

Workload: MoVR - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: MoVR - Concurrency: 10246c8c10c12c2004006008001000SE +/- 1.56, N = 3SE +/- 3.18, N = 3SE +/- 0.58, N = 3SE +/- 1.42, N = 3952.7946.9949.5953.8

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time6c8c10c12c50100150200250227.90166.15117.94109.541. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 5006c8c10c12c40K80K120K160K200KSE +/- 113.87, N = 3SE +/- 453.48, N = 3SE +/- 335.64, N = 3SE +/- 291.63, N = 3196805.30197081.98198858.66201032.061. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfig6c8c10c12c612182430SE +/- 0.22, N = 7SE +/- 0.21, N = 9SE +/- 0.21, N = 14SE +/- 0.19, N = 1124.7525.5325.4125.50

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweet6c8c10c12c246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 6SE +/- 0.01, N = 36.556.576.496.591. (CXX) g++ options: -O3

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beam6c8c10c12c20406080100SE +/- 0.71, N = 3SE +/- 0.70, N = 3SE +/- 0.75, N = 3SE +/- 0.79, N = 379.6279.2079.7079.86

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Barbershop - Compute: CPU-Only6c8c10c12c20406080100SE +/- 0.31, N = 3SE +/- 0.24, N = 3SE +/- 0.15, N = 3SE +/- 0.21, N = 379.9380.1880.3781.03

Neural Magic DeepSparse

Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream6c8c10c12c306090120150SE +/- 1.56, N = 15SE +/- 0.19, N = 3SE +/- 0.08, N = 3SE +/- 0.07, N = 3150.92123.86113.41111.89

Neural Magic DeepSparse

Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream6c8c10c12c2004006008001000SE +/- 6.69, N = 15SE +/- 1.22, N = 3SE +/- 0.53, N = 3SE +/- 0.57, N = 3635.02773.07844.43856.02

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU6c8c10c12c1326395265SE +/- 0.37, N = 3SE +/- 0.57, N = 6SE +/- 0.08, N = 3SE +/- 0.12, N = 358.6754.8051.2949.98MIN: 43.56 / MAX: 315.05MIN: 40.7 / MAX: 276.86MIN: 40.28 / MAX: 292.83MIN: 38.24 / MAX: 187.971. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU6c8c10c12c2004006008001000SE +/- 5.14, N = 3SE +/- 8.79, N = 6SE +/- 1.48, N = 3SE +/- 2.32, N = 3817.27875.39934.71959.161. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninja6c8c10c12c20406080100SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.21, N = 3SE +/- 0.23, N = 376.7575.7375.4475.66

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserID6c8c10c12c246810SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 36.836.866.846.861. (CXX) g++ options: -O3

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweets6c8c10c12c1.28032.56063.84095.12126.4015SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 35.695.665.675.651. (CXX) g++ options: -O3

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU6c8c10c12c2004006008001000SE +/- 4.42, N = 3SE +/- 3.60, N = 3SE +/- 2.71, N = 3SE +/- 3.30, N = 31153.701119.791110.441109.45MIN: 853.88 / MAX: 1939.06MIN: 808.33 / MAX: 1875.91MIN: 769.04 / MAX: 1860.23MIN: 810.74 / MAX: 1835.011. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU6c8c10c12c1020304050SE +/- 0.17, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 341.3342.5942.9442.981. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU6c8c10c12c2004006008001000SE +/- 1.87, N = 3SE +/- 0.54, N = 3SE +/- 5.36, N = 3SE +/- 8.73, N = 31150.541129.011104.591110.68MIN: 870.26 / MAX: 1902.46MIN: 850.94 / MAX: 1870.94MIN: 807.38 / MAX: 1818.79MIN: 833.53 / MAX: 1865.191. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU6c8c10c12c1020304050SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.20, N = 3SE +/- 0.32, N = 341.4442.2243.1842.951. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU6c8c10c12c50100150200250SE +/- 0.13, N = 3SE +/- 0.69, N = 3SE +/- 0.03, N = 3SE +/- 0.32, N = 3250.49249.26249.12250.34MIN: 213.3 / MAX: 307.84MIN: 207.76 / MAX: 340.53MIN: 209.28 / MAX: 311.3MIN: 222.95 / MAX: 301.421. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU6c8c10c12c4080120160200SE +/- 0.09, N = 3SE +/- 0.48, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 3191.29192.25192.30191.431. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU6c8c10c12c100200300400500SE +/- 0.14, N = 3SE +/- 0.27, N = 3SE +/- 0.10, N = 3SE +/- 0.21, N = 3473.69472.84469.43470.98MIN: 423.34 / MAX: 579.41MIN: 394.37 / MAX: 553.15MIN: 432.92 / MAX: 555.25MIN: 451.07 / MAX: 556.041. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU6c8c10c12c20406080100SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3101.08101.26102.01101.741. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 06c8c10c12c1428425670SE +/- 0.47, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 3SE +/- 0.18, N = 363.8062.9663.2563.251. (CXX) g++ options: -O3 -fPIC -lm

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms6c8c10c12c0.02880.05760.08640.11520.144SE +/- 0.00009, N = 3SE +/- 0.00046, N = 3SE +/- 0.00007, N = 3SE +/- 0.00009, N = 30.128200.127680.127590.12783

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU6c8c10c12c1.19252.3853.57754.775.9625SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 35.285.265.285.30MIN: 4.34 / MAX: 38.93MIN: 4.42 / MAX: 42.93MIN: 4.37 / MAX: 41.23MIN: 4.42 / MAX: 40.661. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU6c8c10c12c2K4K6K8K10KSE +/- 7.67, N = 3SE +/- 2.85, N = 3SE +/- 5.19, N = 3SE +/- 9.96, N = 39081.739113.119063.849038.471. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU6c8c10c12c3691215SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.00, N = 39.899.909.919.95MIN: 8.35 / MAX: 32.16MIN: 8.39 / MAX: 56.99MIN: 8.4 / MAX: 50.42MIN: 8.42 / MAX: 52.381. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU6c8c10c12c4K8K12K16K20KSE +/- 33.95, N = 3SE +/- 31.30, N = 3SE +/- 30.88, N = 3SE +/- 12.43, N = 319314.0419278.9319254.0819171.511. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU6c8c10c12c0.22050.4410.66150.8821.1025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.970.980.980.97MIN: 0.86 / MAX: 33.82MIN: 0.86 / MAX: 39.58MIN: 0.85 / MAX: 39.82MIN: 0.85 / MAX: 22.91. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU6c8c10c12c30K60K90K120K150KSE +/- 681.80, N = 3SE +/- 1158.58, N = 3SE +/- 815.42, N = 3SE +/- 1214.59, N = 3121027.25123571.68122938.23119606.211. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 10246c8c10c12c0.63651.2731.90952.5463.1825SE +/- 0.004057, N = 3SE +/- 0.019484, N = 3SE +/- 0.017291, N = 3SE +/- 0.001919, N = 32.8248142.8115552.8061902.8290611. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU6c8c10c12c0.97881.95762.93643.91524.894SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.304.314.334.35MIN: 3.52 / MAX: 43.57MIN: 3.51 / MAX: 43.89MIN: 3.51 / MAX: 41.25MIN: 3.52 / MAX: 41.441. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU6c8c10c12c2K4K6K8K10KSE +/- 1.79, N = 3SE +/- 1.79, N = 3SE +/- 3.30, N = 3SE +/- 1.42, N = 311150.3211108.1611066.1611018.371. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU6c8c10c12c246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36.566.496.456.48MIN: 4.99 / MAX: 59.46MIN: 4.93 / MAX: 59.51MIN: 4.97 / MAX: 59.86MIN: 5.06 / MAX: 59.881. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU6c8c10c12c16003200480064008000SE +/- 4.59, N = 3SE +/- 6.27, N = 3SE +/- 13.32, N = 3SE +/- 2.30, N = 37306.477389.007425.107394.651. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream6c8c10c12c2004006008001000SE +/- 0.67, N = 3SE +/- 0.88, N = 3SE +/- 0.20, N = 3SE +/- 0.82, N = 31148.501136.851133.181133.28

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream6c8c10c12c20406080100SE +/- 0.31, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.18, N = 382.4984.2184.4884.35

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU6c8c10c12c1.09132.18263.27394.36525.4565SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.814.824.834.85MIN: 4.14 / MAX: 27.29MIN: 3.98 / MAX: 28.83MIN: 4.08 / MAX: 28.68MIN: 4.06 / MAX: 28.621. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU6c8c10c12c2K4K6K8K10KSE +/- 3.42, N = 3SE +/- 7.50, N = 3SE +/- 2.08, N = 3SE +/- 2.57, N = 39959.389931.499900.479867.411. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream6c8c10c12c2004006008001000SE +/- 1.05, N = 3SE +/- 1.67, N = 3SE +/- 1.00, N = 3SE +/- 1.25, N = 31148.331137.511135.181133.48

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream6c8c10c12c20406080100SE +/- 0.25, N = 3SE +/- 0.16, N = 3SE +/- 0.03, N = 3SE +/- 0.21, N = 382.2684.1584.2784.25

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: Kostya6c8c10c12c0.92481.84962.77443.69924.624SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 34.114.114.114.111. (CXX) g++ options: -O3

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream6c8c10c12c306090120150SE +/- 0.58, N = 3SE +/- 0.27, N = 3SE +/- 0.55, N = 3SE +/- 0.46, N = 3157.22155.82156.54155.48

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream6c8c10c12c130260390520650SE +/- 2.24, N = 3SE +/- 1.32, N = 3SE +/- 2.48, N = 3SE +/- 1.72, N = 3608.53614.61611.29615.45

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H26c8c10c12c10002000300040005000SE +/- 36.16, N = 20SE +/- 40.50, N = 20SE +/- 39.79, N = 20SE +/- 53.17, N = 204830473148324802

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandom6c8c10c12c0.28130.56260.84391.12521.4065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.241.251.251.251. (CXX) g++ options: -O3

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile6c8c10c12c1122334455SE +/- 0.28, N = 3SE +/- 0.20, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 350.0849.8749.8049.92

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.1.9Time To Compile6c8c10c12c1020304050SE +/- 0.07, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 344.7044.5844.6144.52

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream6c8c10c12c20406080100SE +/- 0.07, N = 3SE +/- 0.20, N = 3SE +/- 0.03, N = 3SE +/- 0.27, N = 380.4479.6979.7180.08

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream6c8c10c12c30060090012001500SE +/- 1.21, N = 3SE +/- 3.22, N = 3SE +/- 0.69, N = 3SE +/- 4.04, N = 31190.531201.981201.141195.91

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 10.2Time To Compile6c8c10c12c1020304050SE +/- 0.12, N = 3SE +/- 0.03, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 343.2542.4142.4141.71

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream6c8c10c12c1122334455SE +/- 0.21, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.12, N = 349.6349.0048.7448.77

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream6c8c10c12c400800120016002000SE +/- 8.40, N = 3SE +/- 1.56, N = 3SE +/- 1.61, N = 3SE +/- 4.95, N = 31930.331954.121965.561964.27

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating6c8c10c12c300K600K900K1200K1500KSE +/- 2020.82, N = 3SE +/- 9235.88, N = 3SE +/- 5138.86, N = 3SE +/- 3305.67, N = 311774841159901117162711814351. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating6c8c10c12c200K400K600K800K1000KSE +/- 7292.38, N = 3SE +/- 3797.71, N = 3SE +/- 2580.44, N = 3SE +/- 6636.11, N = 38249268794308934339231761. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgFLOP/s, More Is BetternekRS 22.0Input: TurboPipe Periodic6c8c10c12c200000M400000M600000M800000M1000000MSE +/- 1934071468.29, N = 3SE +/- 5892587066.25, N = 3SE +/- 7825985326.68, N = 3SE +/- 9551971733.63, N = 36595543333337402470000007862580000008214620000001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -lmpi_cxx -lmpi

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 10246c8c10c12c0.98211.96422.94633.92844.9105SE +/- 0.002133, N = 3SE +/- 0.008144, N = 3SE +/- 0.010431, N = 3SE +/- 0.023689, N = 34.3647674.3514024.3545564.3458901. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile6c8c10c12c816243240SE +/- 0.11, N = 3SE +/- 0.19, N = 3SE +/- 0.04, N = 3SE +/- 0.40, N = 433.6733.9133.6234.03

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 26c8c10c12c816243240SE +/- 0.14, N = 3SE +/- 0.10, N = 3SE +/- 0.08, N = 3SE +/- 0.14, N = 334.8734.6934.9134.851. (CXX) g++ options: -O3 -fPIC -lm

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotube6c8c10c12c612182430SE +/- 0.20, N = 3SE +/- 0.18, N = 3SE +/- 0.13, N = 3SE +/- 0.23, N = 526.3124.6023.3723.151. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 12 - Input: Bosphorus 4K6c8c10c12c60120180240300SE +/- 9.18, N = 13SE +/- 7.53, N = 15SE +/- 7.16, N = 15SE +/- 7.35, N = 15221.16227.90241.37251.77

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster6c8c10c12c246810SE +/- 0.050, N = 3SE +/- 0.078, N = 15SE +/- 0.079, N = 15SE +/- 0.089, N = 156.4096.0186.2856.0011. (CXX) g++ options: -O2 -lOpenCL

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare6c8c10c12c510152025SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 317.9418.6818.6818.711. (CXX) g++ options: -O3

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU6c8c10c12c0.90211.80422.70633.60844.5105SE +/- 0.01788, N = 3SE +/- 0.08932, N = 12SE +/- 0.05885, N = 12SE +/- 0.02537, N = 33.964883.993054.009383.95471MIN: 2.99MIN: 2.67MIN: 2.96MIN: 3.051. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D6c8c10c12c2K4K6K8K10KSE +/- 158.57, N = 12SE +/- 134.50, N = 15SE +/- 206.91, N = 12SE +/- 84.88, N = 35690.016675.717124.928491.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Classroom - Compute: CPU-Only6c8c10c12c510152025SE +/- 0.04, N = 3SE +/- 0.06, N = 3SE +/- 0.09, N = 3SE +/- 0.00, N = 320.7120.6820.7620.92

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compile6c8c10c12c510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 320.7220.5920.4820.46

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile6c8c10c12c510152025SE +/- 0.05, N = 3SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 320.1620.1120.2120.12

Liquid-DSP

Threads: 384 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 384 - Buffer Length: 256 - Filter Length: 576c8c10c12c2000M4000M6000M8000M10000MSE +/- 3214550.25, N = 3SE +/- 5783117.19, N = 3SE +/- 4409585.52, N = 3SE +/- 4582575.69, N = 3103490000001034966666710352666667103470000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 576c8c10c12c2000M4000M6000M8000M10000MSE +/- 3844187.53, N = 3SE +/- 4333333.33, N = 3SE +/- 5196152.42, N = 3SE +/- 4618802.15, N = 3103403333331033766666710340000000103470000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x40966c8c10c12c0.37130.74261.11391.48521.8565SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.541.641.631.65

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM26c8c10c12c80160240320400SE +/- 3.87, N = 3SE +/- 2.53, N = 3SE +/- 1.26, N = 3SE +/- 1.09, N = 3346.08344.64346.68345.611. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM26c8c10c12c2K4K6K8K10KSE +/- 96.81, N = 3SE +/- 63.13, N = 3SE +/- 31.49, N = 3SE +/- 27.15, N = 38651.928615.978666.988640.311. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU6c8c10c12c0.10480.20960.31440.41920.524SE +/- 0.005815, N = 3SE +/- 0.006374, N = 3SE +/- 0.005241, N = 4SE +/- 0.005042, N = 30.4650590.4657960.4634540.446930MIN: 0.38MIN: 0.38MIN: 0.38MIN: 0.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very Fast6c8c10c12c20406080100SE +/- 0.77, N = 3SE +/- 1.04, N = 3SE +/- 0.74, N = 3SE +/- 0.58, N = 1071.4173.0475.3573.441. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1M6c8c10c12c20K40K60K80K100KSE +/- 214.10, N = 3SE +/- 383.60, N = 3SE +/- 152.19, N = 3SE +/- 328.13, N = 3100446.2101953.5102599.6104604.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C6c8c10c12c20K40K60K80K100KSE +/- 554.69, N = 3SE +/- 907.72, N = 15SE +/- 899.80, N = 15SE +/- 812.04, N = 1571662.2879784.1581179.0080225.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Medium6c8c10c12c1428425670SE +/- 0.53, N = 3SE +/- 0.73, N = 3SE +/- 0.11, N = 3SE +/- 0.68, N = 361.4061.8162.2362.561. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1M6c8c10c12c30K60K90K120K150KSE +/- 349.73, N = 3SE +/- 122.05, N = 3SE +/- 70.55, N = 3SE +/- 849.90, N = 3126057.7127081.2127226.6126465.61. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x21606c8c10c12c0.7921.5842.3763.1683.96SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 33.293.473.443.52

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: BMW27 - Compute: CPU-Only6c8c10c12c246810SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 38.338.348.428.58

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C6c8c10c12c40K80K120K160K200KSE +/- 1626.80, N = 15SE +/- 2089.98, N = 15SE +/- 2631.10, N = 15SE +/- 2393.90, N = 3117733.57153458.78177097.42209846.761. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C6c8c10c12c60K120K180K240K300KSE +/- 1838.44, N = 3SE +/- 1630.30, N = 3SE +/- 726.36, N = 3SE +/- 1589.72, N = 3167474.70208535.23239496.01260471.501. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra Fast6c8c10c12c20406080100SE +/- 0.63, N = 3SE +/- 0.71, N = 3SE +/- 1.02, N = 3SE +/- 0.66, N = 375.8676.8477.3077.831. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Exhaustive6c8c10c12c3691215SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 311.8211.8111.7611.731. (CXX) g++ options: -O3 -flto -pthread

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile6c8c10c12c246810SE +/- 0.010, N = 3SE +/- 0.023, N = 3SE +/- 0.034, N = 3SE +/- 0.033, N = 37.7737.8087.7557.777

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C6c8c10c12c100K200K300K400K500KSE +/- 4680.97, N = 5SE +/- 5095.33, N = 5SE +/- 2546.14, N = 3SE +/- 5489.08, N = 4454360.62466769.54489995.20489164.651. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Thorough6c8c10c12c20406080100SE +/- 0.10, N = 3SE +/- 0.04, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3106.51107.11106.85106.571. (CXX) g++ options: -O3 -flto -pthread

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython6c8c10c12c7001400210028003500SE +/- 21.34, N = 4SE +/- 35.24, N = 4SE +/- 18.49, N = 4SE +/- 29.26, N = 43345336933293380

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver6c8c10c12c246810SE +/- 0.024, N = 3SE +/- 0.016, N = 3SE +/- 0.014, N = 3SE +/- 0.031, N = 36.1525.9706.0746.0501. (CXX) g++ options: -O2 -lOpenCL

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, Lossless6c8c10c12c1.19932.39863.59794.79725.9965SE +/- 0.055, N = 3SE +/- 0.034, N = 3SE +/- 0.044, N = 3SE +/- 0.076, N = 35.3305.2705.2865.2871. (CXX) g++ options: -O3 -fPIC -lm

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 10, Lossless6c8c10c12c0.97581.95162.92743.90324.879SE +/- 0.043, N = 3SE +/- 0.009, N = 3SE +/- 0.055, N = 3SE +/- 0.024, N = 34.2504.2524.3374.2411. (CXX) g++ options: -O3 -fPIC -lm

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Crown6c8c10c12c4080120160200SE +/- 0.33, N = 3SE +/- 0.36, N = 3SE +/- 0.47, N = 3SE +/- 1.01, N = 3187.61185.49184.73182.45MIN: 146.69 / MAX: 208.25MIN: 134.45 / MAX: 211.64MIN: 137.82 / MAX: 210.21MIN: 128.42 / MAX: 209.42

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon6c8c10c12c50100150200250SE +/- 0.46, N = 3SE +/- 0.39, N = 3SE +/- 0.47, N = 3SE +/- 0.13, N = 3221.29217.41214.31213.75MIN: 215.19 / MAX: 233.21MIN: 211.73 / MAX: 230.1MIN: 209.11 / MAX: 223.97MIN: 209.16 / MAX: 225.43

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate6c8c10c12c1632486480SE +/- 0.13, N = 3SE +/- 0.05, N = 3SE +/- 0.02, N = 3SE +/- 0.33, N = 370.9071.0170.6170.411. (CC) gcc options: -O3 -march=native -fopenmp

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 66c8c10c12c0.55331.10661.65992.21322.7665SE +/- 0.004, N = 3SE +/- 0.017, N = 3SE +/- 0.003, N = 3SE +/- 0.016, N = 32.4352.4202.4112.4591. (CXX) g++ options: -O3 -fPIC -lm


Phoronix Test Suite v10.8.5