AMD EPYC Genoa Memory Scaling

Benchmarks by Michael Larabel for a future article.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2212240-NE-AMDEPYCGE62
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results

Limit displaying results to tests within:

AV1 2 Tests
BLAS (Basic Linear Algebra Sub-Routine) Tests 3 Tests
C++ Boost Tests 2 Tests
Timed Code Compilation 11 Tests
C/C++ Compiler Tests 10 Tests
CPU Massive 15 Tests
Creator Workloads 14 Tests
Database Test Suite 2 Tests
Encoding 4 Tests
Fortran Tests 5 Tests
Game Development 5 Tests
HPC - High Performance Computing 21 Tests
Java Tests 2 Tests
LAPACK (Linear Algebra Pack) Tests 2 Tests
Machine Learning 5 Tests
Molecular Dynamics 5 Tests
MPI Benchmarks 5 Tests
Multi-Core 30 Tests
NVIDIA GPU Compute 4 Tests
Intel oneAPI 6 Tests
OpenMPI Tests 13 Tests
Programmer / Developer System Benchmarks 13 Tests
Python Tests 11 Tests
Renderers 3 Tests
Scientific Computing 7 Tests
Server 4 Tests
Server CPU Tests 11 Tests
Video Encoding 3 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
12c
December 21 2022
  11 Hours, 55 Minutes
10c
December 21 2022
  12 Hours, 59 Minutes
8c
December 22 2022
  13 Hours, 22 Minutes
6c
December 23 2022
  15 Hours, 14 Minutes
Invert Hiding All Results Option
  13 Hours, 22 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


AMD EPYC Genoa Memory Scaling - Phoronix Test Suite

AMD EPYC Genoa Memory Scaling

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2212240-NE-AMDEPYCGE62&rdt&grs.

AMD EPYC Genoa Memory ScalingProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelDesktopDisplay ServerVulkanCompilerFile-SystemScreen Resolution12c10c8c6c2 x AMD EPYC 9654 96-Core @ 2.40GHz (192 Cores / 384 Threads)AMD Titanite_4G (RTI1002E BIOS)AMD Device 14a41520GB800GB INTEL SSDPF21Q800GBASPEEDVGA HDMIBroadcom NetXtreme BCM5720 PCIeUbuntu 22.106.1.0-phx (x86_64)GNOME Shell 43.0X Server 1.21.1.41.3.224GCC 12.2.0 + Clang 15.0.2-1ext41920x10801264GB1008GB768GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10110dJava Details- OpenJDK Runtime Environment (build 11.0.17+8-post-Ubuntu-1ubuntu2)Python Details- Python 3.10.7Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

AMD EPYC Genoa Memory Scalingincompact3d: X3D-benchmarking input.i3dopenfoam: drivaerFastback, Medium Mesh Size - Execution Timerelion: Basic - CPUwrf: conus 2.5kmnpb: MG.Cnpb: SP.Cgraph500: 26deepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection,YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamnekrs: TurboPipe Periodicopenvino: Machine Translation EN To DE FP16 - CPUopenvino: Machine Translation EN To DE FP16 - CPUtensorflow: CPU - 256 - ResNet-50gpaw: Carbon Nanotubenpb: CG.Ccompress-7zip: Compression Ratingopenvkl: vklBenchmark ISPConednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUnpb: LU.Coidn: RTLightmap.hdr.4096x4096oidn: RT.hdr_alb_nrm.3840x2160rodinia: OpenMP Streamclusteronednn: Recurrent Neural Network Inference - u8s8f32 - CPUkvazaar: Bosphorus 4K - Very Fastcassandra: Writesgromacs: MPI CPU - water_GMX50_bareonednn: Matrix Multiply Batch Shapes Transformer - u8s8f32 - CPUopenvino: Person Detection FP32 - CPUopenvino: Person Detection FP32 - CPUxmrig: Monero - 1Mopenvino: Person Detection FP16 - CPUopenvino: Person Detection FP16 - CPUbuild-gdb: Time To Compilebuild-gem5: Time To Compilecockroach: KV, 50% Reads - 512embree: Pathtracer ISPC - Asian Dragoncockroach: KV, 10% Reads - 512openvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUbuild-linux-kernel: defconfigopenvino: Age Gender Recognition Retail 0013 FP16 - CPUrodinia: OpenMP CFD Solverblender: BMW27 - CPU-Onlycockroach: KV, 10% Reads - 1024embree: Pathtracer ISPC - Crownkvazaar: Bosphorus 4K - Ultra Fastospray: particle_volume/scivis/real_timedeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamcockroach: KV, 60% Reads - 512avifenc: 10, Losslessospray: particle_volume/ao/real_timenginx: 500dacapobench: H2cockroach: KV, 50% Reads - 1024avifenc: 6kvazaar: Bosphorus 4K - Mediumcompress-7zip: Decompression Ratingopenvino: Age Gender Recognition Retail 0013 FP16 - CPUdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streambuild-llvm: Ninjaopenvino: Vehicle Detection FP16 - CPUopenvino: Vehicle Detection FP16 - CPUbuild-nodejs: Time To Compileonnx: fcn-resnet101-11 - CPU - Standardospray: gravity_spheres_volume/dim_512/pathtracer/real_timesimdjson: TopTweetdacapobench: Jythonblender: Barbershop - CPU-Onlybuild-linux-kernel: allmodconfigdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamavifenc: 0deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamcockroach: KV, 60% Reads - 1024build-apache: Time To Compilenwchem: C240 Buckyballcockroach: MoVR - 512build-godot: Time To Compileopenvino: Vehicle Detection FP16-INT8 - CPUopenradioss: Bird Strike on Windshieldopenvino: Vehicle Detection FP16-INT8 - CPUblender: Classroom - CPU-Onlyavifenc: 6, Losslessdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamopenvino: Age Gender Recognition Retail 0013 FP16-INT8 - CPUdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamopenradioss: INIVOL and Fluid Structure Interaction Drop Containeropenvino: Weld Porosity Detection FP16 - CPUxmrig: Wownero - 1Mopenvino: Face Detection FP16 - CPUopenvino: Face Detection FP16 - CPUmt-dgemm: Sustained Floating-Point Rateopenradioss: Bumper Beamopenvino: Weld Porosity Detection FP16 - CPUopenvino: Person Vehicle Bike Detection FP16 - CPUastcenc: Exhaustivestargate: 192000 - 1024ospray: particle_volume/pathtracer/real_timesimdjson: LargeRandopenvino: Person Vehicle Bike Detection FP16 - CPUopenvino: Weld Porosity Detection FP16-INT8 - CPUcockroach: MoVR - 1024simdjson: PartialTweetsospray: gravity_spheres_volume/dim_512/scivis/real_timebuild-mplayer: Time To Compileospray: gravity_spheres_volume/dim_512/ao/real_timeavifenc: 2openvino: Weld Porosity Detection FP16-INT8 - CPUminibude: OpenMP - BM2minibude: OpenMP - BM2build2: Time To Compileastcenc: Thoroughopenvino: Face Detection FP16-INT8 - CPUopenvino: Face Detection FP16-INT8 - CPUbuild-mesa: Time To Compilenamd: ATPase Simulation - 327,506 Atomssimdjson: DistinctUserIDstargate: 96000 - 1024build-php: Time To Compileliquid-dsp: 256 - 256 - 57liquid-dsp: 384 - 256 - 57simdjson: Kostyacockroach: KV, 95% Reads - 1024cockroach: KV, 95% Reads - 512onednn: Recurrent Neural Network Training - u8s8f32 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUsvt-av1: Preset 12 - Bosphorus 4Kluxcorerender: Orange Juice - CPUluxcorerender: Danish Mood - CPUnpb: IS.Dhpcg: 12c10c8c6c125.526248109.53721128.1014070.19209846.76260471.50565152000111.8909856.0186125.7196761.485382146200000049.98959.16109.1323.15180225.0192317613252275.86489164.651.653.526.0012344.2973.4425179318.7060.44693042.951110.68104604.642.981109.4541.709139.23847621.9213.750735970.0119606.2125.501147769.266.0508.5836846.9182.449877.8342.797084.247084.350652330.14.24143.7061201032.06480247465.52.45962.5611814350.551964.273048.767375.6556.487394.65101.46525453.77486.59338081.03147.1471133.282363.2471133.477452573.320.4611537.1948.534.03211018.37216.884.3520.925.287615.4474155.47970.971195.910280.079081.579867.41126465.6101.74470.9870.40773379.864.859038.4711.72502.829061229.2691.255.3019171.51953.85.6543.12627.77743.977534.8489.958640.310345.61249.917106.5663250.34191.4320.1170.127836.864.34589044.51910347000000103470000004.1164661.864467.61968.703.95471251.76928.829.698491.0186.8143146.289830117.94003151.3984563.183177097.42239496.01574018000113.4079844.4268128.9249742.795678625800000051.29934.71105.9123.37381179.0089343313172325.71489995.201.633.446.2852438.0075.3524360318.6770.46345443.181104.59102599.642.941110.4442.412134.37349102.7214.309335993.1122938.2325.407147717.326.0748.4235776.8184.734677.3042.999984.265784.482251748.84.33743.0316198858.66483248449.02.41162.2311716270.551965.560648.743175.4406.457425.10101.94125554.40836.49332980.37145.4101133.182163.2461135.184551959.520.4801531949.633.61611066.16218.224.3320.765.286611.2926156.53740.981201.139179.713981.159900.47127226.6102.01469.4370.61377579.704.839063.8411.76372.806190230.2821.255.2819254.08949.55.6743.32877.75543.996934.9099.918666.980346.67949.800106.8542249.12192.3020.2050.127596.844.35455644.60810340000000103526666674.1162029.860769.72030.724.00938241.36928.199.627124.9248.2945270.091271166.14971221.3366551.876153458.78208535.23531854000123.8576773.0686135.6212705.711674024700000054.80875.39105.0124.59879784.1587943013252371.78466769.541.643.476.0182375.4573.0424085418.6780.46579642.221129.01101953.542.591119.7942.409136.79347596.6217.406034832.9123571.6825.528152292.395.9708.3436685.7185.490776.8443.844284.154684.211552515.24.25243.9700197081.98473147498.12.42061.8111599010.551954.122748.998275.7256.497389.00101.14925754.50876.57336980.18147.3771136.854462.9611137.511952559.020.5891519.6960.333.90511108.16219.454.3120.685.270614.6105155.81910.981201.983979.686581.099931.49127081.2101.26472.8471.01032379.204.829113.1111.80902.811555228.5811.255.2619278.93946.95.6643.43107.80844.230234.6879.908615.967344.63949.871107.1108249.26192.2520.1070.127686.864.35140244.58310337666667103496666674.1158195.564111.91982.153.99305227.89829.049.566675.7145.0005348.880025227.89595258.5007432.655117733.57167474.70392496000150.9167635.0246166.4322575.751865955433333358.67817.2795.6726.30871662.2882492612122471.57454360.621.543.296.4092479.6271.4124688217.9400.46505941.441150.54100446.241.331153.7043.245134.69547428.0221.289835742.3121027.2524.747151213.176.1528.3336329.6187.610775.8643.239682.261382.486951275.14.25043.3575196805.30483047593.92.43561.4011774840.541930.327749.627876.7476.567306.47102.77625354.60546.55334579.93145.7661148.496463.8031148.327852626.420.7201517.9954.733.67111150.32219.104.3020.715.330608.5336157.21640.971190.528680.439980.819959.38126057.7101.08473.6970.89831279.624.819081.7311.82072.824814230.4401.245.2819314.04952.75.6943.28997.77344.271634.8749.898651.924346.07750.084106.5095250.49191.2920.1570.128206.834.36476744.69810340333333103490000004.1160137.362666.52072.573.96488221.16128.909.495690.0136.5411OpenBenchmarking.org

Xcompact3d Incompact3d

Input: X3D-benchmarking input.i3d

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: X3D-benchmarking input.i3d12c10c8c6c80160240320400SE +/- 0.14, N = 3SE +/- 0.11, N = 3SE +/- 2.69, N = 9SE +/- 4.79, N = 9125.53146.29270.09348.881. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Medium Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Medium Mesh Size - Execution Time12c10c8c6c50100150200250109.54117.94166.15227.901. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lfiniteVolume -lmeshTools -lparallel -llagrangian -lregionModels -lgenericPatchFields -lOpenFOAM -ldl -lm

RELION

Test: Basic - Device: CPU

OpenBenchmarking.orgSeconds, Fewer Is BetterRELION 3.1.1Test: Basic - Device: CPU12c10c8c6c60120180240300SE +/- 1.38, N = 5SE +/- 1.86, N = 4SE +/- 2.88, N = 3SE +/- 2.59, N = 6128.10151.40221.34258.501. (CXX) g++ options: -fopenmp -std=c++0x -O3 -rdynamic -ldl -ltiff -lfftw3f -lfftw3 -lpng -lmpi_cxx -lmpi

WRF

Input: conus 2.5km

OpenBenchmarking.orgSeconds, Fewer Is BetterWRF 4.2.2Input: conus 2.5km12c10c8c6c160032004800640080004070.194563.186551.887432.661. (F9X) gfortran options: -O2 -ftree-vectorize -funroll-loops -ffree-form -fconvert=big-endian -frecord-marker=4 -fallow-invalid-boz -lesmf_time -lwrfio_nf -lnetcdff -lnetcdf -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

NAS Parallel Benchmarks

Test / Class: MG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: MG.C12c10c8c6c40K80K120K160K200KSE +/- 2393.90, N = 3SE +/- 2631.10, N = 15SE +/- 2089.98, N = 15SE +/- 1626.80, N = 15209846.76177097.42153458.78117733.571. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

NAS Parallel Benchmarks

Test / Class: SP.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: SP.C12c10c8c6c60K120K180K240K300KSE +/- 1589.72, N = 3SE +/- 726.36, N = 3SE +/- 1630.30, N = 3SE +/- 1838.44, N = 3260471.50239496.01208535.23167474.701. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Graph500

Scale: 26

OpenBenchmarking.orgsssp median_TEPS, More Is BetterGraph500 3.0Scale: 2612c10c8c6c120M240M360M480M600M5651520005740180005318540003924960001. (CC) gcc options: -fcommon -O3 -lpthread -lm -lmpi

Neural Magic DeepSparse

Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream12c10c8c6c306090120150SE +/- 0.07, N = 3SE +/- 0.08, N = 3SE +/- 0.19, N = 3SE +/- 1.56, N = 15111.89113.41123.86150.92

Neural Magic DeepSparse

Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Detection,YOLOv5s COCO - Scenario: Asynchronous Multi-Stream12c10c8c6c2004006008001000SE +/- 0.57, N = 3SE +/- 0.53, N = 3SE +/- 1.22, N = 3SE +/- 6.69, N = 15856.02844.43773.07635.02

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream12c10c8c6c4080120160200SE +/- 0.11, N = 3SE +/- 0.43, N = 3SE +/- 0.38, N = 3SE +/- 1.66, N = 15125.72128.92135.62166.43

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream12c10c8c6c160320480640800SE +/- 0.72, N = 3SE +/- 2.41, N = 3SE +/- 2.11, N = 3SE +/- 6.13, N = 15761.49742.80705.71575.75

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgFLOP/s, More Is BetternekRS 22.0Input: TurboPipe Periodic12c10c8c6c200000M400000M600000M800000M1000000MSE +/- 9551971733.63, N = 3SE +/- 7825985326.68, N = 3SE +/- 5892587066.25, N = 3SE +/- 1934071468.29, N = 38214620000007862580000007402470000006595543333331. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -lmpi_cxx -lmpi

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU12c10c8c6c1326395265SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.57, N = 6SE +/- 0.37, N = 349.9851.2954.8058.67MIN: 38.24 / MAX: 187.97MIN: 40.28 / MAX: 292.83MIN: 40.7 / MAX: 276.86MIN: 43.56 / MAX: 315.051. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Machine Translation EN To DE FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Machine Translation EN To DE FP16 - Device: CPU12c10c8c6c2004006008001000SE +/- 2.32, N = 3SE +/- 1.48, N = 3SE +/- 8.79, N = 6SE +/- 5.14, N = 3959.16934.71875.39817.271. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

TensorFlow

Device: CPU - Batch Size: 256 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 256 - Model: ResNet-5012c10c8c6c20406080100SE +/- 0.48, N = 3SE +/- 0.36, N = 3SE +/- 0.48, N = 3SE +/- 0.26, N = 3109.13105.91105.0195.67

GPAW

Input: Carbon Nanotube

OpenBenchmarking.orgSeconds, Fewer Is BetterGPAW 22.1Input: Carbon Nanotube12c10c8c6c612182430SE +/- 0.23, N = 5SE +/- 0.13, N = 3SE +/- 0.18, N = 3SE +/- 0.20, N = 323.1523.3724.6026.311. (CC) gcc options: -shared -fwrapv -O2 -lxc -lblas -lmpi

NAS Parallel Benchmarks

Test / Class: CG.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: CG.C12c10c8c6c20K40K60K80K100KSE +/- 812.04, N = 15SE +/- 899.80, N = 15SE +/- 907.72, N = 15SE +/- 554.69, N = 380225.0181179.0079784.1571662.281. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Rating12c10c8c6c200K400K600K800K1000KSE +/- 6636.11, N = 3SE +/- 2580.44, N = 3SE +/- 3797.71, N = 3SE +/- 7292.38, N = 39231768934338794308249261. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPC12c10c8c6c30060090012001500SE +/- 6.93, N = 3SE +/- 11.03, N = 9SE +/- 8.82, N = 3SE +/- 15.59, N = 31325131713251212MIN: 329 / MAX: 4553MIN: 327 / MAX: 5660MIN: 330 / MAX: 5664MIN: 328 / MAX: 4115

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU12c10c8c6c5001000150020002500SE +/- 24.22, N = 3SE +/- 25.04, N = 15SE +/- 25.14, N = 15SE +/- 31.16, N = 32275.862325.712371.782471.57MIN: 2213.34MIN: 2171.69MIN: 2234.23MIN: 2410.731. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

NAS Parallel Benchmarks

Test / Class: LU.C

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: LU.C12c10c8c6c100K200K300K400K500KSE +/- 5489.08, N = 4SE +/- 2546.14, N = 3SE +/- 5095.33, N = 5SE +/- 4680.97, N = 5489164.65489995.20466769.54454360.621. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x409612c10c8c6c0.37130.74261.11391.48521.8565SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 31.651.631.641.54

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x216012c10c8c6c0.7921.5842.3763.1683.96SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 33.523.443.473.29

Rodinia

Test: OpenMP Streamcluster

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP Streamcluster12c10c8c6c246810SE +/- 0.089, N = 15SE +/- 0.079, N = 15SE +/- 0.078, N = 15SE +/- 0.050, N = 36.0016.2856.0186.4091. (CXX) g++ options: -O2 -lOpenCL

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: u8s8f32 - Engine: CPU12c10c8c6c5001000150020002500SE +/- 21.01, N = 3SE +/- 30.76, N = 3SE +/- 21.41, N = 3SE +/- 25.74, N = 152344.292438.002375.452479.62MIN: 2288.85MIN: 2353.97MIN: 2319.45MIN: 2293.491. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Very Fast12c10c8c6c20406080100SE +/- 0.58, N = 10SE +/- 0.74, N = 3SE +/- 1.04, N = 3SE +/- 0.77, N = 373.4475.3573.0471.411. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.0Test: Writes12c10c8c6c50K100K150K200K250KSE +/- 3742.45, N = 12SE +/- 2429.87, N = 3SE +/- 1899.17, N = 3SE +/- 2957.03, N = 3251793243603240854246882

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2022.1Implementation: MPI CPU - Input: water_GMX50_bare12c10c8c6c510152025SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 318.7118.6818.6817.941. (CXX) g++ options: -O3

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: u8s8f32 - Engine: CPU12c10c8c6c0.10480.20960.31440.41920.524SE +/- 0.005042, N = 3SE +/- 0.005241, N = 4SE +/- 0.006374, N = 3SE +/- 0.005815, N = 30.4469300.4634540.4657960.465059MIN: 0.38MIN: 0.38MIN: 0.38MIN: 0.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU12c10c8c6c1020304050SE +/- 0.32, N = 3SE +/- 0.20, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 342.9543.1842.2241.441. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP32 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP32 - Device: CPU12c10c8c6c2004006008001000SE +/- 8.73, N = 3SE +/- 5.36, N = 3SE +/- 0.54, N = 3SE +/- 1.87, N = 31110.681104.591129.011150.54MIN: 833.53 / MAX: 1865.19MIN: 807.38 / MAX: 1818.79MIN: 850.94 / MAX: 1870.94MIN: 870.26 / MAX: 1902.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Monero - Hash Count: 1M12c10c8c6c20K40K60K80K100KSE +/- 328.13, N = 3SE +/- 152.19, N = 3SE +/- 383.60, N = 3SE +/- 214.10, N = 3104604.6102599.6101953.5100446.21. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU12c10c8c6c1020304050SE +/- 0.13, N = 3SE +/- 0.12, N = 3SE +/- 0.15, N = 3SE +/- 0.17, N = 342.9842.9442.5941.331. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Detection FP16 - Device: CPU12c10c8c6c2004006008001000SE +/- 3.30, N = 3SE +/- 2.71, N = 3SE +/- 3.60, N = 3SE +/- 4.42, N = 31109.451110.441119.791153.70MIN: 810.74 / MAX: 1835.01MIN: 769.04 / MAX: 1860.23MIN: 808.33 / MAX: 1875.91MIN: 853.88 / MAX: 1939.061. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed GDB GNU Debugger Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed GDB GNU Debugger Compilation 10.2Time To Compile12c10c8c6c1020304050SE +/- 0.17, N = 3SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 341.7142.4142.4143.25

Timed Gem5 Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Gem5 Compilation 21.2Time To Compile12c10c8c6c306090120150SE +/- 0.16, N = 3SE +/- 0.36, N = 3SE +/- 0.77, N = 3SE +/- 0.57, N = 3139.24134.37136.79134.70

CockroachDB

Workload: KV, 50% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 51212c10c8c6c11K22K33K44K55KSE +/- 464.03, N = 15SE +/- 514.54, N = 3SE +/- 454.84, N = 15SE +/- 32.88, N = 347621.949102.747596.647428.0

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Asian Dragon12c10c8c6c50100150200250SE +/- 0.13, N = 3SE +/- 0.47, N = 3SE +/- 0.39, N = 3SE +/- 0.46, N = 3213.75214.31217.41221.29MIN: 209.16 / MAX: 225.43MIN: 209.11 / MAX: 223.97MIN: 211.73 / MAX: 230.1MIN: 215.19 / MAX: 233.21

CockroachDB

Workload: KV, 10% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 10% Reads - Concurrency: 51212c10c8c6c8K16K24K32K40KSE +/- 343.66, N = 15SE +/- 270.36, N = 15SE +/- 351.71, N = 6SE +/- 438.30, N = 1535970.035993.134832.935742.3

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c10c8c6c30K60K90K120K150KSE +/- 1214.59, N = 3SE +/- 815.42, N = 3SE +/- 1158.58, N = 3SE +/- 681.80, N = 3119606.21122938.23123571.68121027.251. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfig12c10c8c6c612182430SE +/- 0.19, N = 11SE +/- 0.21, N = 14SE +/- 0.21, N = 9SE +/- 0.22, N = 725.5025.4125.5324.75

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU12c10c8c6c30K60K90K120K150KSE +/- 745.28, N = 3SE +/- 1134.97, N = 10SE +/- 994.61, N = 3SE +/- 365.43, N = 3147769.26147717.32152292.39151213.171. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Rodinia

Test: OpenMP CFD Solver

OpenBenchmarking.orgSeconds, Fewer Is BetterRodinia 3.1Test: OpenMP CFD Solver12c10c8c6c246810SE +/- 0.031, N = 3SE +/- 0.014, N = 3SE +/- 0.016, N = 3SE +/- 0.024, N = 36.0506.0745.9706.1521. (CXX) g++ options: -O2 -lOpenCL

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: BMW27 - Compute: CPU-Only12c10c8c6c246810SE +/- 0.06, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.06, N = 38.588.428.348.33

CockroachDB

Workload: KV, 10% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 10% Reads - Concurrency: 102412c10c8c6c8K16K24K32K40KSE +/- 155.07, N = 3SE +/- 346.25, N = 3SE +/- 322.68, N = 3SE +/- 206.35, N = 336846.935776.836685.736329.6

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 3.13Binary: Pathtracer ISPC - Model: Crown12c10c8c6c4080120160200SE +/- 1.01, N = 3SE +/- 0.47, N = 3SE +/- 0.36, N = 3SE +/- 0.33, N = 3182.45184.73185.49187.61MIN: 128.42 / MAX: 209.42MIN: 137.82 / MAX: 210.21MIN: 134.45 / MAX: 211.64MIN: 146.69 / MAX: 208.25

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Ultra Fast12c10c8c6c20406080100SE +/- 0.66, N = 3SE +/- 1.02, N = 3SE +/- 0.71, N = 3SE +/- 0.63, N = 377.8377.3076.8475.861. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/scivis/real_time12c10c8c6c1020304050SE +/- 0.05, N = 3SE +/- 0.01, N = 3SE +/- 0.03, N = 3SE +/- 0.06, N = 342.8043.0043.8443.24

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream12c10c8c6c20406080100SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.16, N = 3SE +/- 0.25, N = 384.2584.2784.1582.26

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream12c10c8c6c20406080100SE +/- 0.18, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.31, N = 384.3584.4884.2182.49

CockroachDB

Workload: KV, 60% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 60% Reads - Concurrency: 51212c10c8c6c11K22K33K44K55KSE +/- 268.61, N = 3SE +/- 620.92, N = 15SE +/- 411.73, N = 13SE +/- 555.56, N = 1552330.151748.852515.251275.1

libavif avifenc

Encoder Speed: 10, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 10, Lossless12c10c8c6c0.97581.95162.92743.90324.879SE +/- 0.024, N = 3SE +/- 0.055, N = 3SE +/- 0.009, N = 3SE +/- 0.043, N = 34.2414.3374.2524.2501. (CXX) g++ options: -O3 -fPIC -lm

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/ao/real_time12c10c8c6c1020304050SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 343.7143.0343.9743.36

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 50012c10c8c6c40K80K120K160K200KSE +/- 291.63, N = 3SE +/- 335.64, N = 3SE +/- 453.48, N = 3SE +/- 113.87, N = 3201032.06198858.66197081.98196805.301. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

DaCapo Benchmark

Java Test: H2

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: H212c10c8c6c10002000300040005000SE +/- 53.17, N = 20SE +/- 39.79, N = 20SE +/- 40.50, N = 20SE +/- 36.16, N = 204802483247314830

CockroachDB

Workload: KV, 50% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 102412c10c8c6c10K20K30K40K50KSE +/- 366.75, N = 15SE +/- 380.16, N = 3SE +/- 468.66, N = 15SE +/- 391.13, N = 947465.548449.047498.147593.9

libavif avifenc

Encoder Speed: 6

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 612c10c8c6c0.55331.10661.65992.21322.7665SE +/- 0.016, N = 3SE +/- 0.003, N = 3SE +/- 0.017, N = 3SE +/- 0.004, N = 32.4592.4112.4202.4351. (CXX) g++ options: -O3 -fPIC -lm

Kvazaar

Video Input: Bosphorus 4K - Video Preset: Medium

OpenBenchmarking.orgFrames Per Second, More Is BetterKvazaar 2.1Video Input: Bosphorus 4K - Video Preset: Medium12c10c8c6c1428425670SE +/- 0.68, N = 3SE +/- 0.11, N = 3SE +/- 0.73, N = 3SE +/- 0.53, N = 362.5662.2361.8161.401. (CC) gcc options: -pthread -ftree-vectorize -fvisibility=hidden -O2 -lpthread -lm -lrt

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression Rating12c10c8c6c300K600K900K1200K1500KSE +/- 3305.67, N = 3SE +/- 5138.86, N = 3SE +/- 9235.88, N = 3SE +/- 2020.82, N = 311814351171627115990111774841. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16 - Device: CPU12c10c8c6c0.12380.24760.37140.49520.619SE +/- 0.00, N = 3SE +/- 0.00, N = 10SE +/- 0.00, N = 3SE +/- 0.00, N = 30.550.550.550.54MIN: 0.5 / MAX: 34.71MIN: 0.5 / MAX: 41.23MIN: 0.5 / MAX: 30.68MIN: 0.5 / MAX: 34.191. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream12c10c8c6c400800120016002000SE +/- 4.95, N = 3SE +/- 1.61, N = 3SE +/- 1.56, N = 3SE +/- 8.40, N = 31964.271965.561954.121930.33

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream12c10c8c6c1122334455SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.04, N = 3SE +/- 0.21, N = 348.7748.7449.0049.63

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 13.0Build System: Ninja12c10c8c6c20406080100SE +/- 0.23, N = 3SE +/- 0.21, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 375.6675.4475.7376.75

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU12c10c8c6c246810SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 36.486.456.496.56MIN: 5.06 / MAX: 59.88MIN: 4.97 / MAX: 59.86MIN: 4.93 / MAX: 59.51MIN: 4.99 / MAX: 59.461. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Vehicle Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16 - Device: CPU12c10c8c6c16003200480064008000SE +/- 2.30, N = 3SE +/- 13.32, N = 3SE +/- 6.27, N = 3SE +/- 4.59, N = 37394.657425.107389.007306.471. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Node.js Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Node.js Compilation 18.8Time To Compile12c10c8c6c20406080100SE +/- 0.26, N = 3SE +/- 0.29, N = 3SE +/- 0.22, N = 3SE +/- 0.06, N = 3101.47101.94101.15102.78

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

OpenBenchmarking.orgInferences Per Minute, More Is BetterONNX Runtime 1.11Model: fcn-resnet101-11 - Device: CPU - Executor: Standard12c10c8c6c60120180240300SE +/- 2.33, N = 7SE +/- 3.09, N = 3SE +/- 2.84, N = 5SE +/- 2.17, N = 122542552572531. (CXX) g++ options: -ffunction-sections -fdata-sections -march=native -mtune=native -O3 -flto=auto -fno-fat-lto-objects -ldl -lrt

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time12c10c8c6c1224364860SE +/- 0.50, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 353.7754.4154.5154.61

simdjson

Throughput Test: TopTweet

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: TopTweet12c10c8c6c246810SE +/- 0.01, N = 3SE +/- 0.07, N = 6SE +/- 0.01, N = 3SE +/- 0.00, N = 36.596.496.576.551. (CXX) g++ options: -O3

DaCapo Benchmark

Java Test: Jython

OpenBenchmarking.orgmsec, Fewer Is BetterDaCapo Benchmark 9.12-MR1Java Test: Jython12c10c8c6c7001400210028003500SE +/- 29.26, N = 4SE +/- 18.49, N = 4SE +/- 35.24, N = 4SE +/- 21.34, N = 43380332933693345

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Barbershop - Compute: CPU-Only12c10c8c6c20406080100SE +/- 0.21, N = 3SE +/- 0.15, N = 3SE +/- 0.24, N = 3SE +/- 0.31, N = 381.0380.3780.1879.93

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfig12c10c8c6c306090120150SE +/- 0.90, N = 3SE +/- 0.72, N = 3SE +/- 1.03, N = 3SE +/- 0.14, N = 3147.15145.41147.38145.77

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream12c10c8c6c2004006008001000SE +/- 0.82, N = 3SE +/- 0.20, N = 3SE +/- 0.88, N = 3SE +/- 0.67, N = 31133.281133.181136.851148.50

libavif avifenc

Encoder Speed: 0

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 012c10c8c6c1428425670SE +/- 0.18, N = 3SE +/- 0.27, N = 3SE +/- 0.03, N = 3SE +/- 0.47, N = 363.2563.2562.9663.801. (CXX) g++ options: -O3 -fPIC -lm

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream12c10c8c6c2004006008001000SE +/- 1.25, N = 3SE +/- 1.00, N = 3SE +/- 1.67, N = 3SE +/- 1.05, N = 31133.481135.181137.511148.33

CockroachDB

Workload: KV, 60% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 60% Reads - Concurrency: 102412c10c8c6c11K22K33K44K55KSE +/- 239.52, N = 3SE +/- 400.61, N = 10SE +/- 447.89, N = 3SE +/- 448.33, N = 352573.351959.552559.052626.4

Timed Apache Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Apache Compilation 2.4.41Time To Compile12c10c8c6c510152025SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 320.4620.4820.5920.72

NWChem

Input: C240 Buckyball

OpenBenchmarking.orgSeconds, Fewer Is BetterNWChem 7.0.2Input: C240 Buckyball12c10c8c6c300600900120015001537.11531.01519.61517.91. (F9X) gfortran options: -lnwctask -lccsd -lmcscf -lselci -lmp2 -lmoints -lstepper -ldriver -loptim -lnwdft -lgradients -lcphf -lesp -lddscf -ldangchang -lguess -lhessian -lvib -lnwcutil -lrimp2 -lproperty -lsolvation -lnwints -lprepar -lnwmd -lnwpw -lofpw -lpaw -lpspw -lband -lnwpwlib -lcafe -lspace -lanalyze -lqhop -lpfft -ldplot -ldrdy -lvscf -lqmmm -lqmd -letrans -ltce -lbq -lmm -lcons -lperfm -ldntmc -lccca -ldimqm -lga -larmci -lpeigs -l64to32 -lopenblas -lpthread -lrt -llapack -lnwcblas -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz -lcomex -m64 -ffast-math -std=legacy -fdefault-integer-8 -finline-functions -O2

CockroachDB

Workload: MoVR - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: MoVR - Concurrency: 51212c10c8c6c2004006008001000SE +/- 3.38, N = 3SE +/- 3.66, N = 3SE +/- 9.03, N = 3SE +/- 4.87, N = 3948.5949.6960.3954.7

Timed Godot Game Engine Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Godot Game Engine Compilation 3.2.3Time To Compile12c10c8c6c816243240SE +/- 0.40, N = 4SE +/- 0.04, N = 3SE +/- 0.19, N = 3SE +/- 0.11, N = 334.0333.6233.9133.67

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU12c10c8c6c2K4K6K8K10KSE +/- 1.42, N = 3SE +/- 3.30, N = 3SE +/- 1.79, N = 3SE +/- 1.79, N = 311018.3711066.1611108.1611150.321. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshield12c10c8c6c50100150200250SE +/- 0.38, N = 3SE +/- 0.54, N = 3SE +/- 0.19, N = 3SE +/- 0.14, N = 3216.88218.22219.45219.10

OpenVINO

Model: Vehicle Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Vehicle Detection FP16-INT8 - Device: CPU12c10c8c6c0.97881.95762.93643.91524.894SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.354.334.314.30MIN: 3.52 / MAX: 41.44MIN: 3.51 / MAX: 41.25MIN: 3.51 / MAX: 43.89MIN: 3.52 / MAX: 43.571. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: Classroom - Compute: CPU-Only12c10c8c6c510152025SE +/- 0.00, N = 3SE +/- 0.09, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 320.9220.7620.6820.71

libavif avifenc

Encoder Speed: 6, Lossless

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 6, Lossless12c10c8c6c1.19932.39863.59794.79725.9965SE +/- 0.076, N = 3SE +/- 0.044, N = 3SE +/- 0.034, N = 3SE +/- 0.055, N = 35.2875.2865.2705.3301. (CXX) g++ options: -O3 -fPIC -lm

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream12c10c8c6c130260390520650SE +/- 1.72, N = 3SE +/- 2.48, N = 3SE +/- 1.32, N = 3SE +/- 2.24, N = 3615.45611.29614.61608.53

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream12c10c8c6c306090120150SE +/- 0.46, N = 3SE +/- 0.55, N = 3SE +/- 0.27, N = 3SE +/- 0.58, N = 3155.48156.54155.82157.22

OpenVINO

Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Age Gender Recognition Retail 0013 FP16-INT8 - Device: CPU12c10c8c6c0.22050.4410.66150.8821.1025SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.970.980.980.97MIN: 0.85 / MAX: 22.9MIN: 0.85 / MAX: 39.82MIN: 0.86 / MAX: 39.58MIN: 0.86 / MAX: 33.821. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream12c10c8c6c30060090012001500SE +/- 4.04, N = 3SE +/- 0.69, N = 3SE +/- 3.22, N = 3SE +/- 1.21, N = 31195.911201.141201.981190.53

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.1Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream12c10c8c6c20406080100SE +/- 0.27, N = 3SE +/- 0.03, N = 3SE +/- 0.20, N = 3SE +/- 0.07, N = 380.0879.7179.6980.44

OpenRadioss

Model: INIVOL and Fluid Structure Interaction Drop Container

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: INIVOL and Fluid Structure Interaction Drop Container12c10c8c6c20406080100SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.12, N = 3SE +/- 0.08, N = 381.5781.1581.0980.81

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU12c10c8c6c2K4K6K8K10KSE +/- 2.57, N = 3SE +/- 2.08, N = 3SE +/- 7.50, N = 3SE +/- 3.42, N = 39867.419900.479931.499959.381. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.18.1Variant: Wownero - Hash Count: 1M12c10c8c6c30K60K90K120K150KSE +/- 849.90, N = 3SE +/- 70.55, N = 3SE +/- 122.05, N = 3SE +/- 349.73, N = 3126465.6127226.6127081.2126057.71. (CXX) g++ options: -fexceptions -fno-rtti -maes -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU12c10c8c6c20406080100SE +/- 0.08, N = 3SE +/- 0.03, N = 3SE +/- 0.04, N = 3SE +/- 0.06, N = 3101.74102.01101.26101.081. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16 - Device: CPU12c10c8c6c100200300400500SE +/- 0.21, N = 3SE +/- 0.10, N = 3SE +/- 0.27, N = 3SE +/- 0.14, N = 3470.98469.43472.84473.69MIN: 451.07 / MAX: 556.04MIN: 432.92 / MAX: 555.25MIN: 394.37 / MAX: 553.15MIN: 423.34 / MAX: 579.411. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point Rate12c10c8c6c1632486480SE +/- 0.33, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.13, N = 370.4170.6171.0170.901. (CC) gcc options: -O3 -march=native -fopenmp

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beam12c10c8c6c20406080100SE +/- 0.79, N = 3SE +/- 0.75, N = 3SE +/- 0.70, N = 3SE +/- 0.71, N = 379.8679.7079.2079.62

OpenVINO

Model: Weld Porosity Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16 - Device: CPU12c10c8c6c1.09132.18263.27394.36525.4565SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 34.854.834.824.81MIN: 4.06 / MAX: 28.62MIN: 4.08 / MAX: 28.68MIN: 3.98 / MAX: 28.83MIN: 4.14 / MAX: 27.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU12c10c8c6c2K4K6K8K10KSE +/- 9.96, N = 3SE +/- 5.19, N = 3SE +/- 2.85, N = 3SE +/- 7.67, N = 39038.479063.849113.119081.731. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

ASTC Encoder

Preset: Exhaustive

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Exhaustive12c10c8c6c3691215SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 311.7311.7611.8111.821. (CXX) g++ options: -O3 -flto -pthread

Stargate Digital Audio Workstation

Sample Rate: 192000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 192000 - Buffer Size: 102412c10c8c6c0.63651.2731.90952.5463.1825SE +/- 0.001919, N = 3SE +/- 0.017291, N = 3SE +/- 0.019484, N = 3SE +/- 0.004057, N = 32.8290612.8061902.8115552.8248141. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: particle_volume/pathtracer/real_time12c10c8c6c50100150200250SE +/- 1.54, N = 3SE +/- 1.94, N = 3SE +/- 1.74, N = 3SE +/- 0.59, N = 3229.27230.28228.58230.44

simdjson

Throughput Test: LargeRandom

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: LargeRandom12c10c8c6c0.28130.56260.84391.12521.4065SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 31.251.251.251.241. (CXX) g++ options: -O3

OpenVINO

Model: Person Vehicle Bike Detection FP16 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Person Vehicle Bike Detection FP16 - Device: CPU12c10c8c6c1.19252.3853.57754.775.9625SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 35.305.285.265.28MIN: 4.42 / MAX: 40.66MIN: 4.37 / MAX: 41.23MIN: 4.42 / MAX: 42.93MIN: 4.34 / MAX: 38.931. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU12c10c8c6c4K8K12K16K20KSE +/- 12.43, N = 3SE +/- 30.88, N = 3SE +/- 31.30, N = 3SE +/- 33.95, N = 319171.5119254.0819278.9319314.041. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

CockroachDB

Workload: MoVR - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: MoVR - Concurrency: 102412c10c8c6c2004006008001000SE +/- 1.42, N = 3SE +/- 0.58, N = 3SE +/- 3.18, N = 3SE +/- 1.56, N = 3953.8949.5946.9952.7

simdjson

Throughput Test: PartialTweets

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: PartialTweets12c10c8c6c1.28032.56063.84095.12126.4015SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 35.655.675.665.691. (CXX) g++ options: -O3

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/scivis/real_time12c10c8c6c1020304050SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.13, N = 3SE +/- 0.15, N = 343.1343.3343.4343.29

Timed MPlayer Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed MPlayer Compilation 1.5Time To Compile12c10c8c6c246810SE +/- 0.033, N = 3SE +/- 0.034, N = 3SE +/- 0.023, N = 3SE +/- 0.010, N = 37.7777.7557.8087.773

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.10Benchmark: gravity_spheres_volume/dim_512/ao/real_time12c10c8c6c1020304050SE +/- 0.13, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3SE +/- 0.07, N = 343.9844.0044.2344.27

libavif avifenc

Encoder Speed: 2

OpenBenchmarking.orgSeconds, Fewer Is Betterlibavif avifenc 0.11Encoder Speed: 212c10c8c6c816243240SE +/- 0.14, N = 3SE +/- 0.08, N = 3SE +/- 0.10, N = 3SE +/- 0.14, N = 334.8534.9134.6934.871. (CXX) g++ options: -O3 -fPIC -lm

OpenVINO

Model: Weld Porosity Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Weld Porosity Detection FP16-INT8 - Device: CPU12c10c8c6c3691215SE +/- 0.00, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 39.959.919.909.89MIN: 8.42 / MAX: 52.38MIN: 8.4 / MAX: 50.42MIN: 8.39 / MAX: 56.99MIN: 8.35 / MAX: 32.161. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM212c10c8c6c2K4K6K8K10KSE +/- 27.15, N = 3SE +/- 31.49, N = 3SE +/- 63.13, N = 3SE +/- 96.81, N = 38640.318666.988615.978651.921. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM2

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM212c10c8c6c80160240320400SE +/- 1.09, N = 3SE +/- 1.26, N = 3SE +/- 2.53, N = 3SE +/- 3.87, N = 3345.61346.68344.64346.081. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

Build2

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterBuild2 0.13Time To Compile12c10c8c6c1122334455SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.20, N = 3SE +/- 0.28, N = 349.9249.8049.8750.08

ASTC Encoder

Preset: Thorough

OpenBenchmarking.orgMT/s, More Is BetterASTC Encoder 4.0Preset: Thorough12c10c8c6c20406080100SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 3SE +/- 0.10, N = 3106.57106.85107.11106.511. (CXX) g++ options: -O3 -flto -pthread

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgms, Fewer Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU12c10c8c6c50100150200250SE +/- 0.32, N = 3SE +/- 0.03, N = 3SE +/- 0.69, N = 3SE +/- 0.13, N = 3250.34249.12249.26250.49MIN: 222.95 / MAX: 301.42MIN: 209.28 / MAX: 311.3MIN: 207.76 / MAX: 340.53MIN: 213.3 / MAX: 307.841. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

OpenVINO

Model: Face Detection FP16-INT8 - Device: CPU

OpenBenchmarking.orgFPS, More Is BetterOpenVINO 2022.3Model: Face Detection FP16-INT8 - Device: CPU12c10c8c6c4080120160200SE +/- 0.21, N = 3SE +/- 0.03, N = 3SE +/- 0.48, N = 3SE +/- 0.09, N = 3191.43192.30192.25191.291. (CXX) g++ options: -isystem -fsigned-char -ffunction-sections -fdata-sections -msse4.1 -msse4.2 -O3 -fno-strict-overflow -fwrapv -fPIC -fvisibility=hidden -Os -std=c++11 -MD -MT -MF

Timed Mesa Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Mesa Compilation 21.0Time To Compile12c10c8c6c510152025SE +/- 0.07, N = 3SE +/- 0.06, N = 3SE +/- 0.07, N = 3SE +/- 0.05, N = 320.1220.2120.1120.16

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atoms12c10c8c6c0.02880.05760.08640.11520.144SE +/- 0.00009, N = 3SE +/- 0.00007, N = 3SE +/- 0.00046, N = 3SE +/- 0.00009, N = 30.127830.127590.127680.12820

simdjson

Throughput Test: DistinctUserID

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: DistinctUserID12c10c8c6c246810SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 36.866.846.866.831. (CXX) g++ options: -O3

Stargate Digital Audio Workstation

Sample Rate: 96000 - Buffer Size: 1024

OpenBenchmarking.orgRender Ratio, More Is BetterStargate Digital Audio Workstation 22.11.5Sample Rate: 96000 - Buffer Size: 102412c10c8c6c0.98211.96422.94633.92844.9105SE +/- 0.023689, N = 3SE +/- 0.010431, N = 3SE +/- 0.008144, N = 3SE +/- 0.002133, N = 34.3458904.3545564.3514024.3647671. (CXX) g++ options: -lpthread -lsndfile -lm -O3 -march=native -ffast-math -funroll-loops -fstrength-reduce -fstrict-aliasing -finline-functions

Timed PHP Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed PHP Compilation 8.1.9Time To Compile12c10c8c6c1020304050SE +/- 0.06, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 3SE +/- 0.07, N = 344.5244.6144.5844.70

Liquid-DSP

Threads: 256 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 256 - Buffer Length: 256 - Filter Length: 5712c10c8c6c2000M4000M6000M8000M10000MSE +/- 4618802.15, N = 3SE +/- 5196152.42, N = 3SE +/- 4333333.33, N = 3SE +/- 3844187.53, N = 3103470000001034000000010337666667103403333331. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 384 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 2021.01.31Threads: 384 - Buffer Length: 256 - Filter Length: 5712c10c8c6c2000M4000M6000M8000M10000MSE +/- 4582575.69, N = 3SE +/- 4409585.52, N = 3SE +/- 5783117.19, N = 3SE +/- 3214550.25, N = 3103470000001035266666710349666667103490000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

simdjson

Throughput Test: Kostya

OpenBenchmarking.orgGB/s, More Is Bettersimdjson 2.0Throughput Test: Kostya12c10c8c6c0.92481.84962.77443.69924.624SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.01, N = 34.114.114.114.111. (CXX) g++ options: -O3

CockroachDB

Workload: KV, 95% Reads - Concurrency: 1024

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 102412c10c8c6c14K28K42K56K70KSE +/- 575.30, N = 3SE +/- 1142.40, N = 15SE +/- 1317.65, N = 15SE +/- 1310.27, N = 1564661.862029.858195.560137.3

CockroachDB

Workload: KV, 95% Reads - Concurrency: 512

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 51212c10c8c6c14K28K42K56K70KSE +/- 702.29, N = 3SE +/- 1044.13, N = 15SE +/- 890.57, N = 3SE +/- 813.26, N = 1564467.660769.764111.962666.5

oneDNN

Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Training - Data Type: u8s8f32 - Engine: CPU12c10c8c6c400800120016002000SE +/- 31.84, N = 15SE +/- 14.89, N = 3SE +/- 28.30, N = 3SE +/- 16.27, N = 101968.702030.721982.152072.57MIN: 1632.62MIN: 1981.15MIN: 1911.33MIN: 1942.141. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU12c10c8c6c0.90211.80422.70633.60844.5105SE +/- 0.02537, N = 3SE +/- 0.05885, N = 12SE +/- 0.08932, N = 12SE +/- 0.01788, N = 33.954714.009383.993053.96488MIN: 3.05MIN: 2.96MIN: 2.67MIN: 2.991. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl -lpthread

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.4Encoder Mode: Preset 12 - Input: Bosphorus 4K12c10c8c6c60120180240300SE +/- 7.35, N = 15SE +/- 7.16, N = 15SE +/- 7.53, N = 15SE +/- 9.18, N = 13251.77241.37227.90221.16

LuxCoreRender

Scene: Orange Juice - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Orange Juice - Acceleration: CPU12c10c8c6c714212835SE +/- 0.63, N = 15SE +/- 0.29, N = 3SE +/- 0.72, N = 15SE +/- 0.71, N = 1528.8228.1929.0428.90MIN: 23.01 / MAX: 45.86MIN: 23.3 / MAX: 45.65MIN: 22.62 / MAX: 45.48MIN: 22.4 / MAX: 44.91

LuxCoreRender

Scene: Danish Mood - Acceleration: CPU

OpenBenchmarking.orgM samples/sec, More Is BetterLuxCoreRender 2.6Scene: Danish Mood - Acceleration: CPU12c10c8c6c3691215SE +/- 0.09, N = 15SE +/- 0.17, N = 12SE +/- 0.11, N = 15SE +/- 0.14, N = 129.699.629.569.49MIN: 4 / MAX: 12.39MIN: 3.97 / MAX: 12.9MIN: 3.94 / MAX: 12.41MIN: 3.85 / MAX: 12.15

NAS Parallel Benchmarks

Test / Class: IS.D

OpenBenchmarking.orgTotal Mop/s, More Is BetterNAS Parallel Benchmarks 3.4Test / Class: IS.D12c10c8c6c2K4K6K8K10KSE +/- 84.88, N = 3SE +/- 206.91, N = 12SE +/- 134.50, N = 15SE +/- 158.57, N = 128491.017124.926675.715690.011. (F9X) gfortran options: -O3 -march=native -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz 2. Open MPI 4.1.4

High Performance Conjugate Gradient

OpenBenchmarking.orgGFLOP/s, More Is BetterHigh Performance Conjugate Gradient 3.112c10c8c6c20406080100SE +/- 1.12, N = 12SE +/- 3.31, N = 9SE +/- 0.49, N = 9SE +/- 0.99, N = 986.8148.2945.0036.541. (CXX) g++ options: -O3 -ffast-math -ftree-vectorize -lmpi_cxx -lmpi


Phoronix Test Suite v10.8.4