AMD EPYC Genoa Memory Scaling

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2212240-NE-AMDEPYCGE62&grs.

AMD EPYC Genoa Memory ScalingProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolution12c10c8c6c2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1002E BIOS)AMD Device 14a41520GB800GB INTEL SSDPF21Q800GBASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeUbuntu 22.106.1.0-phx (x86_64)GNOME Shell 43.0X Server 1.21.1.41.3.224GCC 12.2.0 + Clang 15.0.2-1ext41920x10801264GB1008GB768GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10110dJava Details- OpenJDK Runtime Environment (build 11.0.17+8-post-Ubuntu-1ubuntu2)Python Details- Python 3.10.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa Memory Scalingincompact3d: X3D-benchmarking input.i3dopenfoam: drivaerFastback, Medium Mesh Size - Execution Timerelion: Basic - CPUwrf: conus 2.5kmnpb: MG.Cnpb: SP.Cgraph500: 26deepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamnekrs: TurboPipe Periodicopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUtensorflow: CPU - 256 - ResNet-50gpaw: Carbon Nanotubenpb: CG.Ccompress-7zip: Compression Ratingopenvkl: vklBenchmark ISPConednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUnpb: LU.Coidn: RTLightmap.hdr.4096x4096oidn: RT.hdr_alb_nrm.3840x2160rodinia: OpenMP Streamclusteronednn: Recurrent Neural Network Inference - u8s8f32 - CPUkvazaar: Bosphorus 4K - Very Fastcassandra: Writesgromacs: MPI CPU - water_GMX50_bareonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUxmrig: Monero - 1Mopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUbuild-gdb: Time To Compilebuild-gem5: Time To Compilecockroach: KV, 50% Reads - 512embree: Pathtracer ISPC - Asian Dragoncockroach: KV, 10% Reads - 512openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUbuild-linux-kernel: defconfigopenvino: Age Gender Recognition Retail 0013 FP16 - CPUrodinia: OpenMP CFD Solverblender: BMW27 - CPU-Onlycockroach: KV, 10% Reads - 1024embree: Pathtracer ISPC - Crownkvazaar: Bosphorus 4K - Ultra Fastospray: particle_volume/scivis/real_timedeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamcockroach: KV, 60% Reads - 512avifenc: 10, Losslessospray: particle_volume/ao/real_timenginx: 500dacapobench: H2cockroach: KV, 50% Reads - 1024avifenc: 6kvazaar: Bosphorus 4K - Mediumcompress-7zip: Decompression Ratingopenvino: Age Gender Recognition Retail 0013 FP16 - CPUdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streambuild-llvm: Ninjaopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUbuild-nodejs: Time To Compileonnx: fcn-resnet101-11 - CPU - Standardospray: gravity_spheres_volume/dim_512/pathtracer/real_timesimdjson: TopTweetdacapobench: Jythonblender: Barbershop - CPU-Onlybuild-linux-kernel: allmodconfigdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamavifenc: 0deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamcockroach: KV, 60% Reads - 1024build-apache: Time To Compilenwchem: C240 Buckyballcockroach: MoVR - 512build-godot: Time To Compileopenvino: Vehicle Detection FP16-INT8 - CPUopenradioss: Bird Strike on Windshieldopenvino: Vehicle Detection FP16-INT8 - CPUblender: Classroom - CPU-Onlyavifenc: 6, Losslessdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamopenradioss: INIVOL and Fluid Structure Interaction Drop Containeropenvino: Weld Porosity Detection FP16 - CPUxmrig: Wownero - 1Mopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUmt-dgemm: Sustained Floating-Point Rateopenradioss: Bumper Beamopenvino: Weld Porosity Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUastcenc: Exhaustivestargate: 192000 - 1024ospray: particle_volume/pathtracer/real_timesimdjson: LargeRandopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUcockroach: MoVR - 1024simdjson: PartialTweetsospray: gravity_spheres_volume/dim_512/scivis/real_timebuild-mplayer: Time To Compileospray: gravity_spheres_volume/dim_512/ao/real_timeavifenc: 2openvino: Weld Porosity Detection FP16-INT8 - CPUminibude: OpenMP - BM2minibude: OpenMP - BM2build2: Time To Compileastcenc: Thoroughopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUbuild-mesa: Time To Compilenamd: ATPase Simulation - 327,506 Atomssimdjson: DistinctUserIDstargate: 96000 - 1024build-php: Time To Compileliquid-dsp: 256 - 256 - 57liquid-dsp: 384 - 256 - 57simdjson: Kostyacockroach: KV, 95% Reads - 1024cockroach: KV, 95% Reads - 512onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUsvt-av1: Preset 12 - Bosphorus 4Kluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUnpb: IS.Dhpcg: 12c10c8c6c125.526248109.53721128.1014070.19209846.76260471.50565152000111.8909856.0186125.7196761.485382146200000049.98959.16109.1323.15180225.0192317613252275.86489164.651.653.526.0012344.2973.4425179318.7060.44693042.951110.68104604.642.981109.4541.709139.23847621.9213.750735970.0119606.2125.501147769.266.0508.5836846.9182.449877.8342.797084.247084.350652330.14.24143.7061201032.06480247465.52.45962.5611814350.551964.273048.767375.6556.487394.65101.46525453.77486.59338081.03147.1471133.282363.2471133.477452573.320.4611537.1948.534.03211018.37216.884.3520.925.287615.4474155.47970.971195.910280.079081.579867.41126465.6101.74470.9870.40773379.864.859038.4711.72502.829061229.2691.255.3019171.51953.85.6543.12627.77743.977534.8489.958640.310345.61249.917106.5663250.34191.4320.1170.127836.864.34589044.51910347000000103470000004.1164661.864467.61968.703.95471251.76928.829.698491.0186.8143146.289830117.94003151.3984563.183177097.42239496.01574018000113.4079844.4268128.9249742.795678625800000051.29934.71105.9123.37381179.0089343313172325.71489995.201.633.446.2852438.0075.3524360318.6770.46345443.181104.59102599.642.941110.4442.412134.37349102.7214.309335993.1122938.2325.407147717.326.0748.4235776.8184.734677.3042.999984.265784.482251748.84.33743.0316198858.66483248449.02.41162.2311716270.551965.560648.743175.4406.457425.10101.94125554.40836.49332980.37145.4101133.182163.2461135.184551959.520.4801531949.633.61611066.16218.224.3320.765.286611.2926156.53740.981201.139179.713981.159900.47127226.6102.01469.4370.61377579.704.839063.8411.76372.806190230.2821.255.2819254.08949.55.6743.32877.75543.996934.9099.918666.980346.67949.800106.8542249.12192.3020.2050.127596.844.35455644.60810340000000103526666674.1162029.860769.72030.724.00938241.36928.199.627124.9248.2945270.091271166.14971221.3366551.876153458.78208535.23531854000123.8576773.0686135.6212705.711674024700000054.80875.39105.0124.59879784.1587943013252371.78466769.541.643.476.0182375.4573.0424085418.6780.46579642.221129.01101953.542.591119.7942.409136.79347596.6217.406034832.9123571.6825.528152292.395.9708.3436685.7185.490776.8443.844284.154684.211552515.24.25243.9700197081.98473147498.12.42061.8111599010.551954.122748.998275.7256.497389.00101.14925754.50876.57336980.18147.3771136.854462.9611137.511952559.020.5891519.6960.333.90511108.16219.454.3120.685.270614.6105155.81910.981201.983979.686581.099931.49127081.2101.26472.8471.01032379.204.829113.1111.80902.811555228.5811.255.2619278.93946.95.6643.43107.80844.230234.6879.908615.967344.63949.871107.1108249.26192.2520.1070.127686.864.35140244.58310337666667103496666674.1158195.564111.91982.153.99305227.89829.049.566675.7145.0005348.880025227.89595258.5007432.655117733.57167474.70392496000150.9167635.0246166.4322575.751865955433333358.67817.2795.6726.30871662.2882492612122471.57454360.621.543.296.4092479.6271.4124688217.9400.46505941.441150.54100446.241.331153.7043.245134.69547428.0221.289835742.3121027.2524.747151213.176.1528.3336329.6187.610775.8643.239682.261382.486951275.14.25043.3575196805.30483047593.92.43561.4011774840.541930.327749.627876.7476.567306.47102.77625354.60546.55334579.93145.7661148.496463.8031148.327852626.420.7201517.9954.733.67111150.32219.104.3020.715.330608.5336157.21640.971190.528680.439980.819959.38126057.7101.08473.6970.89831279.624.819081.7311.82072.824814230.4401.245.2819314.04952.75.6943.28997.77344.271634.8749.898651.924346.07750.084106.5095250.49191.2920.1570.128206.834.36476744.69810340333333103490000004.1160137.362666.52072.573.96488221.16128.909.495690.0136.5411OpenBenchmarking.org

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d12c10c8c6c80160240320400SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 2.69, N = 9SE +/- 4.79, N = 9125.53146.29270.09348.881. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time12c10c8c6c50100150200250109.54117.94166.15227.901. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU12c10c8c6c60120180240300SE +/- 1.38, N = 5SE +/- 1.86, N = 4SE +/- 2.88, N = 3SE +/- 2.59, N = 6128.10151.40221.34258.501. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -lmpi_cxx -lmpi

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km12c10c8c6c160032004800640080004070.194563.186551.887432.661. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C12c10c8c6c40K80K120K160K200KSE +/- 2393.90, N = 3SE +/- 2631.10, N = 15SE +/- 2089.98, N = 15SE +/- 1626.80, N = 15209846.76177097.42153458.78117733.571. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C12c10c8c6c60K120K180K240K300KSE +/- 1589.72, N = 3SE +/- 726.36, N = 3SE +/- 1630.30, N = 3SE +/- 1838.44, N = 3260471.50239496.01208535.23167474.701. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 2612c10c8c6c120M240M360M480M600M5651520005740180005318540003924960001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Neural Magic DeepSparse

Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream12c10c8c6c306090120150SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 1.56, N = 15111.89113.41123.86150.92

Neural Magic DeepSparse

Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream12c10c8c6c2004006008001000SE +/- 0.57, N = 3SE +/- 0.53, N = 3SE +/- 1.22, N = 3SE +/- 6.69, N = 15856.02844.43773.07635.02

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream12c10c8c6c4080120160200SE +/- 0.11, N = 3SE +/- 0.43, N = 3SE +/- 0.38, N = 3SE +/- 1.66, N = 15125.72128.92135.62166.43

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream12c10c8c6c160320480640800SE +/- 0.72, N = 3SE +/- 2.41, N = 3SE +/- 2.11, N = 3SE +/- 6.13, N = 15761.49742.80705.71575.75

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgFLOP/s, More Is BetternekRS 22.0Input: TurboPipe Periodic12c10c8c6c200000M400000M600000M800000M1000000MSE +/- 9551971733.63, N = 3SE +/- 7825985326.68, N = 3SE +/- 5892587066.25, N = 3SE +/- 1934071468.29, N = 38214620000007862580000007402470000006595543333331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -lmpi_cxx -lmpi

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU12c10c8c6c1326395265SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.57, N = 6SE +/- 0.37, N = 349.9851.2954.8058.67MIN: 38.24 / MAX: 187.97MIN: 40.28 / MAX: 292.83MIN: 40.7 / MAX: 276.86MIN: 43.56 / MAX: 315.051. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU12c10c8c6c2004006008001000SE +/- 2.32, N = 3SE +/- 1.48, N = 3SE +/- 8.79, N = 6SE +/- 5.14, N = 3959.16934.71875.39817.271. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

TensorFlow

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 256 - Model: ResNet-5012c10c8c6c20406080100SE +/- 0.48, N = 3SE +/- 0.36, N = 3SE +/- 0.48, N = 3SE +/- 0.26, N = 3109.13105.91105.0195.67

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotube12c10c8c6c612182430SE +/- 0.23, N = 5SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.20, N = 323.1523.3724.6026.311. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C12c10c8c6c20K40K60K80K100KSE +/- 812.04, N = 15SE +/- 899.80, N = 15SE +/- 907.72, N = 15SE +/- 554.69, N = 380225.0181179.0079784.1571662.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating12c10c8c6c200K400K600K800K1000KSE +/- 6636.11, N = 3SE +/- 2580.44, N = 3SE +/- 3797.71, N = 3SE +/- 7292.38, N = 39231768934338794308249261. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPC12c10c8c6c30060090012001500SE +/- 6.93, N = 3SE +/- 11.03, N = 9SE +/- 8.82, N = 3SE +/- 15.59, N = 31325131713251212MIN: 329 / MAX: 4553MIN: 327 / MAX: 5660MIN: 330 / MAX: 5664MIN: 328 / MAX: 4115

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12c10c8c6c5001000150020002500SE +/- 24.22, N = 3SE +/- 25.04, N = 15SE +/- 25.14, N = 15SE +/- 31.16, N = 32275.862325.712371.782471.57MIN: 2213.34MIN: 2171.69MIN: 2234.23MIN: 2410.731. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12c10c8c6c100K200K300K400K500KSE +/- 5489.08, N = 4SE +/- 2546.14, N = 3SE +/- 5095.33, N = 5SE +/- 4680.97, N = 5489164.65489995.20466769.54454360.621. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x409612c10c8c6c0.37130.74261.11391.48521.8565SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.651.631.641.54

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x216012c10c8c6c0.7921.5842.3763.1683.96SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 33.523.443.473.29

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster12c10c8c6c246810SE +/- 0.089, N = 15SE +/- 0.079, N = 15SE +/- 0.078, N = 15SE +/- 0.050, N = 36.0016.2856.0186.4091. (CXX) g++ options: -O2 -lOpenCL

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12c10c8c6c5001000150020002500SE +/- 21.01, N = 3SE +/- 30.76, N = 3SE +/- 21.41, N = 3SE +/- 25.74, N = 152344.292438.002375.452479.62MIN: 2288.85MIN: 2353.97MIN: 2319.45MIN: 2293.491. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very Fast12c10c8c6c20406080100SE +/- 0.58, N = 10SE +/- 0.74, N = 3SE +/- 1.04, N = 3SE +/- 0.77, N = 373.4475.3573.0471.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Writes12c10c8c6c50K100K150K200K250KSE +/- 3742.45, N = 12SE +/- 2429.87, N = 3SE +/- 1899.17, N = 3SE +/- 2957.03, N = 3251793243603240854246882

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare12c10c8c6c510152025SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 318.7118.6818.6817.941. (CXX) g++ options: -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU12c10c8c6c0.10480.20960.31440.41920.524SE +/- 0.005042, N = 3SE +/- 0.005241, N = 4SE +/- 0.006374, N = 3SE +/- 0.005815, N = 30.4469300.4634540.4657960.465059MIN: 0.38MIN: 0.38MIN: 0.38MIN: 0.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU12c10c8c6c1020304050SE +/- 0.32, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 342.9543.1842.2241.441. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU12c10c8c6c2004006008001000SE +/- 8.73, N = 3SE +/- 5.36, N = 3SE +/- 0.54, N = 3SE +/- 1.87, N = 31110.681104.591129.011150.54MIN: 833.53 / MAX: 1865.19MIN: 807.38 / MAX: 1818.79MIN: 850.94 / MAX: 1870.94MIN: 870.26 / MAX: 1902.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1M12c10c8c6c20K40K60K80K100KSE +/- 328.13, N = 3SE +/- 152.19, N = 3SE +/- 383.60, N = 3SE +/- 214.10, N = 3104604.6102599.6101953.5100446.21. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU12c10c8c6c1020304050SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.15, N = 3SE +/- 0.17, N = 342.9842.9442.5941.331. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU12c10c8c6c2004006008001000SE +/- 3.30, N = 3SE +/- 2.71, N = 3SE +/- 3.60, N = 3SE +/- 4.42, N = 31109.451110.441119.791153.70MIN: 810.74 / MAX: 1835.01MIN: 769.04 / MAX: 1860.23MIN: 808.33 / MAX: 1875.91MIN: 853.88 / MAX: 1939.061. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 10.2Time To Compile12c10c8c6c1020304050SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 341.7142.4142.4143.25

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compile12c10c8c6c306090120150SE +/- 0.16, N = 3SE +/- 0.36, N = 3SE +/- 0.77, N = 3SE +/- 0.57, N = 3139.24134.37136.79134.70

CockroachDB

Workload: KV, 50% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 51212c10c8c6c11K22K33K44K55KSE +/- 464.03, N = 15SE +/- 514.54, N = 3SE +/- 454.84, N = 15SE +/- 32.88, N = 347621.949102.747596.647428.0

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon12c10c8c6c50100150200250SE +/- 0.13, N = 3SE +/- 0.47, N = 3SE +/- 0.39, N = 3SE +/- 0.46, N = 3213.75214.31217.41221.29MIN: 209.16 / MAX: 225.43MIN: 209.11 / MAX: 223.97MIN: 211.73 / MAX: 230.1MIN: 215.19 / MAX: 233.21

CockroachDB

Workload: KV, 10% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 10% Reads - Concurrency: 51212c10c8c6c8K16K24K32K40KSE +/- 343.66, N = 15SE +/- 270.36, N = 15SE +/- 351.71, N = 6SE +/- 438.30, N = 1535970.035993.134832.935742.3

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c10c8c6c30K60K90K120K150KSE +/- 1214.59, N = 3SE +/- 815.42, N = 3SE +/- 1158.58, N = 3SE +/- 681.80, N = 3119606.21122938.23123571.68121027.251. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfig12c10c8c6c612182430SE +/- 0.19, N = 11SE +/- 0.21, N = 14SE +/- 0.21, N = 9SE +/- 0.22, N = 725.5025.4125.5324.75

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU12c10c8c6c30K60K90K120K150KSE +/- 745.28, N = 3SE +/- 1134.97, N = 10SE +/- 994.61, N = 3SE +/- 365.43, N = 3147769.26147717.32152292.39151213.171. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12c10c8c6c246810SE +/- 0.031, N = 3SE +/- 0.014, N = 3SE +/- 0.016, N = 3SE +/- 0.024, N = 36.0506.0745.9706.1521. (CXX) g++ options: -O2 -lOpenCL

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: BMW27 - Compute: CPU-Only12c10c8c6c246810SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 38.588.428.348.33

CockroachDB

Workload: KV, 10% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 10% Reads - Concurrency: 102412c10c8c6c8K16K24K32K40KSE +/- 155.07, N = 3SE +/- 346.25, N = 3SE +/- 322.68, N = 3SE +/- 206.35, N = 336846.935776.836685.736329.6

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Crown12c10c8c6c4080120160200SE +/- 1.01, N = 3SE +/- 0.47, N = 3SE +/- 0.36, N = 3SE +/- 0.33, N = 3182.45184.73185.49187.61MIN: 128.42 / MAX: 209.42MIN: 137.82 / MAX: 210.21MIN: 134.45 / MAX: 211.64MIN: 146.69 / MAX: 208.25

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra Fast12c10c8c6c20406080100SE +/- 0.66, N = 3SE +/- 1.02, N = 3SE +/- 0.71, N = 3SE +/- 0.63, N = 377.8377.3076.8475.861. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/scivis/real_time12c10c8c6c1020304050SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 342.8043.0043.8443.24

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream12c10c8c6c20406080100SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.25, N = 384.2584.2784.1582.26

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream12c10c8c6c20406080100SE +/- 0.18, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.31, N = 384.3584.4884.2182.49

CockroachDB

Workload: KV, 60% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 60% Reads - Concurrency: 51212c10c8c6c11K22K33K44K55KSE +/- 268.61, N = 3SE +/- 620.92, N = 15SE +/- 411.73, N = 13SE +/- 555.56, N = 1552330.151748.852515.251275.1

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 10, Lossless12c10c8c6c0.97581.95162.92743.90324.879SE +/- 0.024, N = 3SE +/- 0.055, N = 3SE +/- 0.009, N = 3SE +/- 0.043, N = 34.2414.3374.2524.2501. (CXX) g++ options: -O3 -fPIC -lm

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/ao/real_time12c10c8c6c1020304050SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 343.7143.0343.9743.36

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 50012c10c8c6c40K80K120K160K200KSE +/- 291.63, N = 3SE +/- 335.64, N = 3SE +/- 453.48, N = 3SE +/- 113.87, N = 3201032.06198858.66197081.98196805.301. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H212c10c8c6c10002000300040005000SE +/- 53.17, N = 20SE +/- 39.79, N = 20SE +/- 40.50, N = 20SE +/- 36.16, N = 204802483247314830

CockroachDB

Workload: KV, 50% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 102412c10c8c6c10K20K30K40K50KSE +/- 366.75, N = 15SE +/- 380.16, N = 3SE +/- 468.66, N = 15SE +/- 391.13, N = 947465.548449.047498.147593.9

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 612c10c8c6c0.55331.10661.65992.21322.7665SE +/- 0.016, N = 3SE +/- 0.003, N = 3SE +/- 0.017, N = 3SE +/- 0.004, N = 32.4592.4112.4202.4351. (CXX) g++ options: -O3 -fPIC -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Medium12c10c8c6c1428425670SE +/- 0.68, N = 3SE +/- 0.11, N = 3SE +/- 0.73, N = 3SE +/- 0.53, N = 362.5662.2361.8161.401. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating12c10c8c6c300K600K900K1200K1500KSE +/- 3305.67, N = 3SE +/- 5138.86, N = 3SE +/- 9235.88, N = 3SE +/- 2020.82, N = 311814351171627115990111774841. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU12c10c8c6c0.12380.24760.37140.49520.619SE +/- 0.00, N = 3SE +/- 0.00, N = 10SE +/- 0.00, N = 3SE +/- 0.00, N = 30.550.550.550.54MIN: 0.5 / MAX: 34.71MIN: 0.5 / MAX: 41.23MIN: 0.5 / MAX: 30.68MIN: 0.5 / MAX: 34.191. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream12c10c8c6c400800120016002000SE +/- 4.95, N = 3SE +/- 1.61, N = 3SE +/- 1.56, N = 3SE +/- 8.40, N = 31964.271965.561954.121930.33

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream12c10c8c6c1122334455SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.21, N = 348.7748.7449.0049.63

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninja12c10c8c6c20406080100SE +/- 0.23, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 375.6675.4475.7376.75

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU12c10c8c6c246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36.486.456.496.56MIN: 5.06 / MAX: 59.88MIN: 4.97 / MAX: 59.86MIN: 4.93 / MAX: 59.51MIN: 4.99 / MAX: 59.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU12c10c8c6c16003200480064008000SE +/- 2.30, N = 3SE +/- 13.32, N = 3SE +/- 6.27, N = 3SE +/- 4.59, N = 37394.657425.107389.007306.471. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 18.8Time To Compile12c10c8c6c20406080100SE +/- 0.26, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3101.47101.94101.15102.78

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standard12c10c8c6c60120180240300SE +/- 2.33, N = 7SE +/- 3.09, N = 3SE +/- 2.84, N = 5SE +/- 2.17, N = 122542552572531. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto=auto -fno-fat-lto-objects -ldl -lrt

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time12c10c8c6c1224364860SE +/- 0.50, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 353.7754.4154.5154.61

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweet12c10c8c6c246810SE +/- 0.01, N = 3SE +/- 0.07, N = 6SE +/- 0.01, N = 3SE +/- 0.00, N = 36.596.496.576.551. (CXX) g++ options: -O3

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython12c10c8c6c7001400210028003500SE +/- 29.26, N = 4SE +/- 18.49, N = 4SE +/- 35.24, N = 4SE +/- 21.34, N = 43380332933693345

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Barbershop - Compute: CPU-Only12c10c8c6c20406080100SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 3SE +/- 0.31, N = 381.0380.3780.1879.93

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfig12c10c8c6c306090120150SE +/- 0.90, N = 3SE +/- 0.72, N = 3SE +/- 1.03, N = 3SE +/- 0.14, N = 3147.15145.41147.38145.77

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream12c10c8c6c2004006008001000SE +/- 0.82, N = 3SE +/- 0.20, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 31133.281133.181136.851148.50

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 012c10c8c6c1428425670SE +/- 0.18, N = 3SE +/- 0.27, N = 3SE +/- 0.03, N = 3SE +/- 0.47, N = 363.2563.2562.9663.801. (CXX) g++ options: -O3 -fPIC -lm

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream12c10c8c6c2004006008001000SE +/- 1.25, N = 3SE +/- 1.00, N = 3SE +/- 1.67, N = 3SE +/- 1.05, N = 31133.481135.181137.511148.33

CockroachDB

Workload: KV, 60% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 60% Reads - Concurrency: 102412c10c8c6c11K22K33K44K55KSE +/- 239.52, N = 3SE +/- 400.61, N = 10SE +/- 447.89, N = 3SE +/- 448.33, N = 352573.351959.552559.052626.4

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compile12c10c8c6c510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 320.4620.4820.5920.72

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball12c10c8c6c300600900120015001537.11531.01519.61517.91. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

CockroachDB

Workload: MoVR - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: MoVR - Concurrency: 51212c10c8c6c2004006008001000SE +/- 3.38, N = 3SE +/- 3.66, N = 3SE +/- 9.03, N = 3SE +/- 4.87, N = 3948.5949.6960.3954.7

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12c10c8c6c816243240SE +/- 0.40, N = 4SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.11, N = 334.0333.6233.9133.67

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU12c10c8c6c2K4K6K8K10KSE +/- 1.42, N = 3SE +/- 3.30, N = 3SE +/- 1.79, N = 3SE +/- 1.79, N = 311018.3711066.1611108.1611150.321. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshield12c10c8c6c50100150200250SE +/- 0.38, N = 3SE +/- 0.54, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 3216.88218.22219.45219.10

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU12c10c8c6c0.97881.95762.93643.91524.894SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.354.334.314.30MIN: 3.52 / MAX: 41.44MIN: 3.51 / MAX: 41.25MIN: 3.51 / MAX: 43.89MIN: 3.52 / MAX: 43.571. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Classroom - Compute: CPU-Only12c10c8c6c510152025SE +/- 0.00, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 320.9220.7620.6820.71

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, Lossless12c10c8c6c1.19932.39863.59794.79725.9965SE +/- 0.076, N = 3SE +/- 0.044, N = 3SE +/- 0.034, N = 3SE +/- 0.055, N = 35.2875.2865.2705.3301. (CXX) g++ options: -O3 -fPIC -lm

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream12c10c8c6c130260390520650SE +/- 1.72, N = 3SE +/- 2.48, N = 3SE +/- 1.32, N = 3SE +/- 2.24, N = 3615.45611.29614.61608.53

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream12c10c8c6c306090120150SE +/- 0.46, N = 3SE +/- 0.55, N = 3SE +/- 0.27, N = 3SE +/- 0.58, N = 3155.48156.54155.82157.22

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c10c8c6c0.22050.4410.66150.8821.1025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.970.980.980.97MIN: 0.85 / MAX: 22.9MIN: 0.85 / MAX: 39.82MIN: 0.86 / MAX: 39.58MIN: 0.86 / MAX: 33.821. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream12c10c8c6c30060090012001500SE +/- 4.04, N = 3SE +/- 0.69, N = 3SE +/- 3.22, N = 3SE +/- 1.21, N = 31195.911201.141201.981190.53

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream12c10c8c6c20406080100SE +/- 0.27, N = 3SE +/- 0.03, N = 3SE +/- 0.20, N = 3SE +/- 0.07, N = 380.0879.7179.6980.44

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Container12c10c8c6c20406080100SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 381.5781.1581.0980.81

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU12c10c8c6c2K4K6K8K10KSE +/- 2.57, N = 3SE +/- 2.08, N = 3SE +/- 7.50, N = 3SE +/- 3.42, N = 39867.419900.479931.499959.381. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1M12c10c8c6c30K60K90K120K150KSE +/- 849.90, N = 3SE +/- 70.55, N = 3SE +/- 122.05, N = 3SE +/- 349.73, N = 3126465.6127226.6127081.2126057.71. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU12c10c8c6c20406080100SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3101.74102.01101.26101.081. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU12c10c8c6c100200300400500SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.14, N = 3470.98469.43472.84473.69MIN: 451.07 / MAX: 556.04MIN: 432.92 / MAX: 555.25MIN: 394.37 / MAX: 553.15MIN: 423.34 / MAX: 579.411. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate12c10c8c6c1632486480SE +/- 0.33, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 370.4170.6171.0170.901. (CC) gcc options: -O3 -march=native -fopenmp

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beam12c10c8c6c20406080100SE +/- 0.79, N = 3SE +/- 0.75, N = 3SE +/- 0.70, N = 3SE +/- 0.71, N = 379.8679.7079.2079.62

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU12c10c8c6c1.09132.18263.27394.36525.4565SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.854.834.824.81MIN: 4.06 / MAX: 28.62MIN: 4.08 / MAX: 28.68MIN: 3.98 / MAX: 28.83MIN: 4.14 / MAX: 27.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU12c10c8c6c2K4K6K8K10KSE +/- 9.96, N = 3SE +/- 5.19, N = 3SE +/- 2.85, N = 3SE +/- 7.67, N = 39038.479063.849113.119081.731. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Exhaustive12c10c8c6c3691215SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 311.7311.7611.8111.821. (CXX) g++ options: -O3 -flto -pthread

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 102412c10c8c6c0.63651.2731.90952.5463.1825SE +/- 0.001919, N = 3SE +/- 0.017291, N = 3SE +/- 0.019484, N = 3SE +/- 0.004057, N = 32.8290612.8061902.8115552.8248141. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/pathtracer/real_time12c10c8c6c50100150200250SE +/- 1.54, N = 3SE +/- 1.94, N = 3SE +/- 1.74, N = 3SE +/- 0.59, N = 3229.27230.28228.58230.44

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandom12c10c8c6c0.28130.56260.84391.12521.4065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.251.251.251.241. (CXX) g++ options: -O3

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU12c10c8c6c1.19252.3853.57754.775.9625SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.305.285.265.28MIN: 4.42 / MAX: 40.66MIN: 4.37 / MAX: 41.23MIN: 4.42 / MAX: 42.93MIN: 4.34 / MAX: 38.931. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU12c10c8c6c4K8K12K16K20KSE +/- 12.43, N = 3SE +/- 30.88, N = 3SE +/- 31.30, N = 3SE +/- 33.95, N = 319171.5119254.0819278.9319314.041. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

CockroachDB

Workload: MoVR - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: MoVR - Concurrency: 102412c10c8c6c2004006008001000SE +/- 1.42, N = 3SE +/- 0.58, N = 3SE +/- 3.18, N = 3SE +/- 1.56, N = 3953.8949.5946.9952.7

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweets12c10c8c6c1.28032.56063.84095.12126.4015SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.655.675.665.691. (CXX) g++ options: -O3

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_time12c10c8c6c1020304050SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 343.1343.3343.4343.29

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile12c10c8c6c246810SE +/- 0.033, N = 3SE +/- 0.034, N = 3SE +/- 0.023, N = 3SE +/- 0.010, N = 37.7777.7557.8087.773

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_time12c10c8c6c1020304050SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 343.9844.0044.2344.27

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 212c10c8c6c816243240SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 334.8534.9134.6934.871. (CXX) g++ options: -O3 -fPIC -lm

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU12c10c8c6c3691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 39.959.919.909.89MIN: 8.42 / MAX: 52.38MIN: 8.4 / MAX: 50.42MIN: 8.39 / MAX: 56.99MIN: 8.35 / MAX: 32.161. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM212c10c8c6c2K4K6K8K10KSE +/- 27.15, N = 3SE +/- 31.49, N = 3SE +/- 63.13, N = 3SE +/- 96.81, N = 38640.318666.988615.978651.921. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM212c10c8c6c80160240320400SE +/- 1.09, N = 3SE +/- 1.26, N = 3SE +/- 2.53, N = 3SE +/- 3.87, N = 3345.61346.68344.64346.081. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12c10c8c6c1122334455SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.20, N = 3SE +/- 0.28, N = 349.9249.8049.8750.08

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Thorough12c10c8c6c20406080100SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3106.57106.85107.11106.511. (CXX) g++ options: -O3 -flto -pthread

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU12c10c8c6c50100150200250SE +/- 0.32, N = 3SE +/- 0.03, N = 3SE +/- 0.69, N = 3SE +/- 0.13, N = 3250.34249.12249.26250.49MIN: 222.95 / MAX: 301.42MIN: 209.28 / MAX: 311.3MIN: 207.76 / MAX: 340.53MIN: 213.3 / MAX: 307.841. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU12c10c8c6c4080120160200SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.48, N = 3SE +/- 0.09, N = 3191.43192.30192.25191.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile12c10c8c6c510152025SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 320.1220.2120.1120.16

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms12c10c8c6c0.02880.05760.08640.11520.144SE +/- 0.00009, N = 3SE +/- 0.00007, N = 3SE +/- 0.00046, N = 3SE +/- 0.00009, N = 30.127830.127590.127680.12820

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserID12c10c8c6c246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.866.846.866.831. (CXX) g++ options: -O3

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 102412c10c8c6c0.98211.96422.94633.92844.9105SE +/- 0.023689, N = 3SE +/- 0.010431, N = 3SE +/- 0.008144, N = 3SE +/- 0.002133, N = 34.3458904.3545564.3514024.3647671. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.1.9Time To Compile12c10c8c6c1020304050SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 344.5244.6144.5844.70

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 5712c10c8c6c2000M4000M6000M8000M10000MSE +/- 4618802.15, N = 3SE +/- 5196152.42, N = 3SE +/- 4333333.33, N = 3SE +/- 3844187.53, N = 3103470000001034000000010337666667103403333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 384 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 384 - Buffer Length: 256 - Filter Length: 5712c10c8c6c2000M4000M6000M8000M10000MSE +/- 4582575.69, N = 3SE +/- 4409585.52, N = 3SE +/- 5783117.19, N = 3SE +/- 3214550.25, N = 3103470000001035266666710349666667103490000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: Kostya12c10c8c6c0.92481.84962.77443.69924.624SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.114.114.114.111. (CXX) g++ options: -O3

CockroachDB

Workload: KV, 95% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 102412c10c8c6c14K28K42K56K70KSE +/- 575.30, N = 3SE +/- 1142.40, N = 15SE +/- 1317.65, N = 15SE +/- 1310.27, N = 1564661.862029.858195.560137.3

CockroachDB

Workload: KV, 95% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 51212c10c8c6c14K28K42K56K70KSE +/- 702.29, N = 3SE +/- 1044.13, N = 15SE +/- 890.57, N = 3SE +/- 813.26, N = 1564467.660769.764111.962666.5

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12c10c8c6c400800120016002000SE +/- 31.84, N = 15SE +/- 14.89, N = 3SE +/- 28.30, N = 3SE +/- 16.27, N = 101968.702030.721982.152072.57MIN: 1632.62MIN: 1981.15MIN: 1911.33MIN: 1942.141. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU12c10c8c6c0.90211.80422.70633.60844.5105SE +/- 0.02537, N = 3SE +/- 0.05885, N = 12SE +/- 0.08932, N = 12SE +/- 0.01788, N = 33.954714.009383.993053.96488MIN: 3.05MIN: 2.96MIN: 2.67MIN: 2.991. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 12 - Input: Bosphorus 4K12c10c8c6c60120180240300SE +/- 7.35, N = 15SE +/- 7.16, N = 15SE +/- 7.53, N = 15SE +/- 9.18, N = 13251.77241.37227.90221.16

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU12c10c8c6c714212835SE +/- 0.63, N = 15SE +/- 0.29, N = 3SE +/- 0.72, N = 15SE +/- 0.71, N = 1528.8228.1929.0428.90MIN: 23.01 / MAX: 45.86MIN: 23.3 / MAX: 45.65MIN: 22.62 / MAX: 45.48MIN: 22.4 / MAX: 44.91

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU12c10c8c6c3691215SE +/- 0.09, N = 15SE +/- 0.17, N = 12SE +/- 0.11, N = 15SE +/- 0.14, N = 129.699.629.569.49MIN: 4 / MAX: 12.39MIN: 3.97 / MAX: 12.9MIN: 3.94 / MAX: 12.41MIN: 3.85 / MAX: 12.15

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D12c10c8c6c2K4K6K8K10KSE +/- 84.88, N = 3SE +/- 206.91, N = 12SE +/- 134.50, N = 15SE +/- 158.57, N = 128491.017124.926675.715690.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.112c10c8c6c20406080100SE +/- 1.12, N = 12SE +/- 3.31, N = 9SE +/- 0.49, N = 9SE +/- 0.99, N = 986.8148.2945.0036.541. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi


Phoronix Test Suite v10.8.4