Google Cloud c3 Sapphire Rapids vs. AMD Milan

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2303280-NE-2303286NE24.

Google Cloud c3 Sapphire Rapids vs. AMD MilanProcessorMotherboardChipsetMemoryDiskNetworkOSKernelVulkanCompilerFile-SystemSystem Layerc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMDIntel Xeon Platinum 8481C (4 Cores / 8 Threads)Google Compute Engine c3-highcpu-8Intel 440FX 82441FX PMC16GB322GB nvme_card-pdGoogle Compute Engine VirtualUbuntu 22.105.19.0-1015-gcp (x86_64)1.3.224GCC 12.2.0ext4KVMIntel Xeon (4 Cores / 8 Threads)Google Compute Engine c2-standard-832GB322GB PersistentDiskRed Hat Virtio deviceGoogle Compute Engine n2-standard-8Google Compute Engine n2-highcpu-88GBAMD EPYC 7B13 (8 Cores)Google Compute Engine t2d-standard-832GBAMD EPYC 7B13 (4 Cores / 8 Threads)Google Compute Engine c2d-highcpu-816GBOpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=x86_64-linux-gnu --disable-vtable-verify --disable-werror --enable-cet --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-multiarch --enable-multilib --enable-nls --enable-objc-gc=auto --enable-offload-defaulted --enable-offload-targets=nvptx-none=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-nvptx/usr,amdgcn-amdhsa=/build/gcc-12-U8K4Qv/gcc-12-12.2.0/debian/tmp-gcn/usr --enable-plugin --enable-shared --enable-threads=posix --host=x86_64-linux-gnu --program-prefix=x86_64-linux-gnu- --target=x86_64-linux-gnu --with-abi=m64 --with-arch-32=i686 --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-multilib-list=m32,m64,mx32 --with-target-system-zlib=auto --with-tune=generic --without-cuda-driver -v Processor Details- CPU Microcode: 0xffffffffPython Details- Python 3.10.7Security Details- c3-highcpu-8 SPR: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Not affected - c2-standard-8 CLX: itlb_multihit: Not affected + l1tf: Not affected + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Not affected + mmio_stale_data: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + retbleed: Mitigation of Enhanced IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT Host state unknown - n2-standard-8 CLX: itlb_multihit: Not affected + l1tf: Not affected + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Not affected + mmio_stale_data: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + retbleed: Mitigation of Enhanced IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT Host state unknown - n2-highcpu-8 CLX: itlb_multihit: Not affected + l1tf: Not affected + mds: Mitigation of Clear buffers; SMT Host state unknown + meltdown: Not affected + mmio_stale_data: Vulnerable: Clear buffers attempted no microcode; SMT Host state unknown + retbleed: Mitigation of Enhanced IBRS + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Enhanced IBRS IBPB: conditional RSB filling PBRSB-eIBRS: SW sequence + srbds: Not affected + tsx_async_abort: Mitigation of Clear buffers; SMT Host state unknown - t2d-standard-8 AMD: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: disabled RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected - c2d-highcpu-8 AMD: itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: conditional RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

Google Cloud c3 Sapphire Rapids vs. AMD Milanlczero: BLASlczero: Eigenminibude: OpenMP - BM1minibude: OpenMP - BM1namd: ATPase Simulation - 327,506 Atomsnekrs: TurboPipe Periodicincompact3d: input.i3d 129 Cells Per Directionopenfoam: drivaerFastback, Small Mesh Size - Mesh Timeopenfoam: drivaerFastback, Small Mesh Size - Execution Timeopenradioss: Bumper Beamopenradioss: Cell Phone Drop Testopenradioss: Bird Strike on Windshieldopenradioss: Rubber O-Ring Seal Installationspecfem3d: Mount St. Helensspecfem3d: Layered Halfspacespecfem3d: Tomographic Modelspecfem3d: Homogeneous Halfspacespecfem3d: Water-layered Halfspacecompress-zstd: 19 - Compression Speedcompress-zstd: 19 - Decompression Speedcompress-zstd: 19, Long Mode - Compression Speedcompress-zstd: 19, Long Mode - Decompression Speedjohn-the-ripper: bcryptjohn-the-ripper: Blowfishjohn-the-ripper: MD5embree: Pathtracer ISPC - Crownembree: Pathtracer ISPC - Asian Dragonuvg266: Bosphorus 4K - Very Fastuvg266: Bosphorus 4K - Super Fastuvg266: Bosphorus 4K - Ultra Fastuvg266: Bosphorus 1080p - Very Fastuvg266: Bosphorus 1080p - Super Fastuvg266: Bosphorus 1080p - Ultra Fastoidn: RT.hdr_alb_nrm.3840x2160oidn: RTLightmap.hdr.4096x4096openvkl: vklBenchmark ISPCcompress-7zip: Compression Ratingbuild-ffmpeg: Time To Compilebuild-linux-kernel: defconfigonednn: IP Shapes 1D - bf16bf16bf16 - CPUonednn: IP Shapes 3D - bf16bf16bf16 - CPUonednn: Convolution Batch Shapes Auto - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_1d - bf16bf16bf16 - CPUonednn: Deconvolution Batch shapes_3d - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Training - bf16bf16bf16 - CPUonednn: Recurrent Neural Network Inference - bf16bf16bf16 - CPUonednn: Matrix Multiply Batch Shapes Transformer - bf16bf16bf16 - CPUospray-studio: 1 - 4K - 1 - Path Tracerospray-studio: 3 - 4K - 1 - Path Traceropenssl: SHA256openssl: SHA512openssl: RSA4096openssl: ChaCha20openssl: AES-128-GCMopenssl: AES-256-GCMopenssl: ChaCha20-Poly1305cockroach: KV, 50% Reads - 128cockroach: KV, 95% Reads - 128memcached: 1:10memcached: 1:100gromacs: MPI CPU - water_GMX50_baremysqlslap: 2048mysqlslap: 4096pgbench: 100 - 800 - Read Onlypgbench: 100 - 800 - Read Only - Average Latencypgbench: 100 - 1000 - Read Onlypgbench: 100 - 1000 - Read Only - Average Latencytensorflow: CPU - 16 - ResNet-50tensorflow: CPU - 32 - ResNet-50tensorflow: CPU - 64 - ResNet-50deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdraco: Liondraco: Church Facadeblender: BMW27 - CPU-Onlynginx: 100nginx: 200nginx: 500nginx: 1000nginx: 4000brl-cad: VGR Performance Metricopencv: Coreopencv: Graph APIopencv: Stitchingopencv: Image Processingopencv: Object Detectionc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD12721221188.6517.5463.357793066790000032.394316362.044277422.67836303.38219.73595.14367.00139.524469614372.089480857143.937022875179.545901627321.62525166810.3905.26.5907.2693269307657135.84757.36426.997.489.1232.3934.5042.240.240.129835306120.438244.8001.500045.342184.177071.471453.549444660.702337.310.9689862095225819428387398715685729202062.72209155763757594077823480083615731597078114019321.624960.11044947.131030937.280.7773323173119422.5652937253.40514.2014.9315.693.7873528.019319.1677104.310264.874730.795233.106460.38586.5372305.899116.2194123.29313.7693530.610062507573315.2436310.3535602.1034672.6532118.5832814.75710728737221993121476012816338999909902152.7986.1124.445342510076666737.757567185.518264560.60744390.61291.49724.80523.45145.379328519374.756877204150.915292633190.794140810347.8128140709.24701.15.86713.8668766846835463.93405.15425.696.037.4526.3027.7234.470.220.117030989139.039289.66219.52878.0241634.189352.942135.91645767.702998.037.432603091137892131819353014658152271156.62134681181323237312603169450006301091915184313684.516184.2715723.53702291.930.5792482371733174.6161696435.89513.3314.1114.792.9308682.374815.9179125.474659.991433.292429.178368.50766.1893323.082813.9932142.88922.9352681.344974371146225148.2824695.9121957.1721446.2721594.945031414277023618625083314723458056810808133.9515.3585.067062296356666740.539009195.062233574.23602436.27322.22811.76591.71162.425332308415.183075434166.377622705213.892383575380.1297625238.24649.55.13660.3597059685996373.45844.57684.975.286.5122.8124.1530.050.200.106328062154.899322.38122.02528.5186438.871760.592540.65426078.813172.348.506783588461337117851387313081540271028.6190551302172076525645315129238003957651335312699.114976.8641713.60624980.660.5282202071582675.0551551966.45012.1812.9513.742.6587752.192515.3621130.018354.959036.344725.973676.96615.7491347.829912.7443156.90262.6597751.938383551233222830.1922570.1820146.1019683.6019947.564456320927726725027849317870464284770801133.9505.3585.0554440.032824291.505362563.86556433.87320.36800.02584.09160.296159080416.413137338167.348051891215.274502309374.1943129998.29640.85.04650.0596259595989713.47144.59944.935.176.4122.8724.1529.970.200.098128093154.903327.75522.03568.6263738.967660.554940.77876112.653157.388.445153519361150117733361313062237201028.8190654859032075355017715133854923956747430012635.114884.7614742.62604363.370.5292202141520935.2741457906.86212.1812.9913.702.6268761.337915.1135132.217854.515936.642625.693777.80995.6481354.048712.8809155.22782.6570752.676383911288222444.4021431.4219806.6019418.0819477.31445562081992891622813381728426626912021039304.05312.1623.108224075636666721.500117848.749592294.3444176.83135.99386.35208.8181.185672352223.10985342881.806465318102.336558318197.52049013310.91129.05.421075.810585106117382105.49286.417310.6811.3613.6749.1952.2561.850.300.15754844980.649184.6834004.682107.392530630299687612773031586548171765.02632528691033948315373314474727201759260007719110.422148.81200527.421224550.681.0983012803233532.4753208273.11810.7410.6110.486.1611645.422923.8083167.670068.855157.988551.354177.77898.1654489.075826.0592153.15626.1956642.97916073875435257.7735225.0435127.2634079.6233592.77991689044321021419582311441032014856743186.0807.4434.457652711520000026.521487663.91618403.83354286.02213.88621.88364.33125.623694120324.852582801128.165738351163.999040336287.42369180611.01253.66.711183.8762076194365854.92315.83507.908.149.8236.4537.7945.530.160.086239203119.640259.3356503.633335.703137037227582417988318633520671025.71726651779018522423023169689386531175932645717168.420571.8842830.77835405.060.6903052922494903.2062441984.0968.668.558.463.4194584.559813.5466147.479538.124252.415028.489570.12894.5842436.250814.3549139.20223.3909588.61325617744225019.8624677.0324265.3123758.4723747.52674378077119947018740011761837666OpenBenchmarking.org

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: BLASc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD30060090012001500SE +/- 16.59, N = 3SE +/- 7.36, N = 3SE +/- 3.18, N = 3SE +/- 6.44, N = 3SE +/- 1.45, N = 3SE +/- 4.70, N = 3127290981077012028561. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.28Backend: Eigenc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD30060090012001500SE +/- 7.69, N = 3SE +/- 9.00, N = 3SE +/- 8.78, N = 5SE +/- 8.29, N = 3SE +/- 3.06, N = 3SE +/- 3.48, N = 3122190280880110397431. (CXX) g++ options: -flto -pthread

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgGFInst/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70140210280350SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.03, N = 3188.65152.80133.95133.95304.05186.081. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

miniBUDE

Implementation: OpenMP - Input Deck: BM1

OpenBenchmarking.orgBillion Interactions/s, More Is BetterminiBUDE 20210901Implementation: OpenMP - Input Deck: BM1c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD3691215SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.002, N = 3SE +/- 0.002, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 37.5466.1125.3585.35812.1627.4431. (CC) gcc options: -std=c99 -Ofast -ffast-math -fopenmp -march=native -lm

NAMD

ATPase Simulation - 327,506 Atoms

OpenBenchmarking.orgdays/ns, Fewer Is BetterNAMD 2.14ATPase Simulation - 327,506 Atomsc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1.14012.28023.42034.56045.7005SE +/- 0.00254, N = 3SE +/- 0.00771, N = 3SE +/- 0.03361, N = 3SE +/- 0.01808, N = 3SE +/- 0.00017, N = 3SE +/- 0.00235, N = 33.357794.445345.067065.055443.108224.45765

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgFLOP/s, More Is BetternekRS 22.0Input: TurboPipe Periodicc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD9000M18000M27000M36000M45000MSE +/- 92013205.57, N = 3SE +/- 56849518.71, N = 3SE +/- 62364769.26, N = 3SE +/- 64451851.11, N = 3SE +/- 48942108.66, N = 330667900000251007666672296356666740756366667271152000001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -lmpi_cxx -lmpi

Xcompact3d Incompact3d

Input: input.i3d 129 Cells Per Direction

OpenBenchmarking.orgSeconds, Fewer Is BetterXcompact3d Incompact3d 2021-03-11Input: input.i3d 129 Cells Per Directionc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD918273645SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 3SE +/- 0.24, N = 4SE +/- 0.05, N = 332.3937.7640.5440.0321.5026.521. (F9X) gfortran options: -cpp -O2 -funroll-loops -floop-optimize -fcray-pointer -fbacktrace -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Mesh Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Mesh Timec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD2040608010062.0485.5295.0691.5148.7563.92-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-ldynamicMesh-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-ldynamicMesh-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenFOAM

Input: drivaerFastback, Small Mesh Size - Execution Time

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenFOAM 10Input: drivaerFastback, Small Mesh Size - Execution Timec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD120240360480600422.68560.61574.24563.87294.34403.83-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-ldynamicMesh-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-ldynamicMesh-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling-lphysicalProperties -lspecie -lfiniteVolume -lfvModels -lmeshTools -lsampling1. (CXX) g++ options: -std=c++14 -m64 -O3 -ftemplate-depth-100 -fPIC -fuse-ld=bfd -Xlinker --add-needed --no-as-needed -lgenericPatchFields -lOpenFOAM -ldl -lm

OpenRadioss

Model: Bumper Beam

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bumper Beamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD90180270360450SE +/- 0.51, N = 3SE +/- 0.90, N = 3SE +/- 2.04, N = 3SE +/- 2.21, N = 3SE +/- 0.34, N = 3SE +/- 0.29, N = 3303.38390.61436.27433.87176.83286.02

OpenRadioss

Model: Cell Phone Drop Test

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Cell Phone Drop Testc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70140210280350SE +/- 0.38, N = 3SE +/- 0.39, N = 3SE +/- 0.36, N = 3SE +/- 0.68, N = 3SE +/- 0.30, N = 3SE +/- 2.62, N = 4219.73291.49322.22320.36135.99213.88

OpenRadioss

Model: Bird Strike on Windshield

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Bird Strike on Windshieldc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD2004006008001000SE +/- 0.30, N = 3SE +/- 1.51, N = 3SE +/- 5.07, N = 3SE +/- 1.32, N = 3SE +/- 0.51, N = 3SE +/- 0.64, N = 3595.14724.80811.76800.02386.35621.88

OpenRadioss

Model: Rubber O-Ring Seal Installation

OpenBenchmarking.orgSeconds, Fewer Is BetterOpenRadioss 2022.10.13Model: Rubber O-Ring Seal Installationc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD130260390520650SE +/- 0.20, N = 3SE +/- 0.45, N = 3SE +/- 7.04, N = 3SE +/- 1.46, N = 3SE +/- 0.50, N = 3SE +/- 0.93, N = 3367.00523.45591.71584.09208.81364.33

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helensc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD4080120160200SE +/- 0.10, N = 3SE +/- 0.56, N = 3SE +/- 0.57, N = 3SE +/- 0.25, N = 3SE +/- 0.46, N = 3SE +/- 0.89, N = 3139.52145.38162.43160.3081.19125.621. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspacec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD90180270360450SE +/- 0.64, N = 3SE +/- 2.51, N = 3SE +/- 0.67, N = 3SE +/- 0.99, N = 3SE +/- 3.19, N = 3SE +/- 1.43, N = 3372.09374.76415.18416.41223.11324.851. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modelc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD4080120160200SE +/- 1.65, N = 3SE +/- 1.30, N = 3SE +/- 0.66, N = 3SE +/- 1.83, N = 3SE +/- 0.35, N = 3SE +/- 0.71, N = 3143.94150.92166.38167.3581.81128.171. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspacec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD50100150200250SE +/- 0.09, N = 3SE +/- 1.87, N = 3SE +/- 1.45, N = 3SE +/- 2.61, N = 3SE +/- 0.29, N = 3SE +/- 0.71, N = 3179.55190.79213.89215.27102.34164.001. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspacec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD80160240320400SE +/- 0.31, N = 3SE +/- 0.85, N = 3SE +/- 1.21, N = 3SE +/- 2.86, N = 3SE +/- 1.86, N = 12SE +/- 0.44, N = 3321.63347.81380.13374.19197.52287.421. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi -lopen-rte -lopen-pal -lhwloc -levent_core -levent_pthreads -lm -lz

Zstd Compression

Compression Level: 19 - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Compression Speedc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD3691215SE +/- 0.12, N = 3SE +/- 0.08, N = 3SE +/- 0.06, N = 3SE +/- 0.08, N = 5SE +/- 0.12, N = 3SE +/- 0.06, N = 310.309.248.248.2910.9011.00-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19 - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19 - Decompression Speedc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD30060090012001500SE +/- 1.50, N = 3SE +/- 1.45, N = 3SE +/- 3.67, N = 3SE +/- 1.15, N = 5SE +/- 4.02, N = 3SE +/- 3.36, N = 3905.2701.1649.5640.81129.01253.6-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Compression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Compression Speedc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD246810SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.04, N = 15SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.04, N = 36.505.865.135.045.426.71-llzma1. (CC) gcc options: -O3 -pthread -lz

Zstd Compression

Compression Level: 19, Long Mode - Decompression Speed

OpenBenchmarking.orgMB/s, More Is BetterZstd Compression 1.5.4Compression Level: 19, Long Mode - Decompression Speedc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD30060090012001500SE +/- 1.48, N = 3SE +/- 2.02, N = 3SE +/- 1.88, N = 15SE +/- 0.09, N = 3SE +/- 7.21, N = 3SE +/- 9.51, N = 3907.2713.8660.3650.01075.81183.8-llzma1. (CC) gcc options: -O3 -pthread -lz

John The Ripper

Test: bcrypt

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: bcryptc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD2K4K6K8K10KSE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 0.67, N = 3SE +/- 1.45, N = 3SE +/- 8.14, N = 3SE +/- 1.00, N = 369326687597059621058576201. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: Blowfish

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: Blowfishc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD2K4K6K8K10KSE +/- 2.08, N = 3SE +/- 2.73, N = 3SE +/- 1.00, N = 3SE +/- 1.45, N = 3SE +/- 13.28, N = 3SE +/- 1.67, N = 369306684596859591061176191. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

John The Ripper

Test: MD5

OpenBenchmarking.orgReal C/S, More Is BetterJohn The Ripper 2023.03.14Test: MD5c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD160K320K480K640K800KSE +/- 1472.13, N = 3SE +/- 162.53, N = 3SE +/- 266.70, N = 3SE +/- 168.31, N = 3SE +/- 385.43, N = 3SE +/- 56.60, N = 37657136835465996375989717382104365851. (CC) gcc options: -m64 -lssl -lcrypto -fopenmp -lm -lrt -lz -ldl -lcrypt

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Crownc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1.31572.63143.94715.26286.5785SE +/- 0.0120, N = 3SE +/- 0.0030, N = 3SE +/- 0.0212, N = 3SE +/- 0.0114, N = 3SE +/- 0.0419, N = 3SE +/- 0.0095, N = 35.84753.93403.45843.47145.49284.9231MIN: 5.81 / MAX: 5.92MIN: 3.91 / MAX: 3.99MIN: 3.38 / MAX: 3.58MIN: 3.4 / MAX: 3.58MIN: 5.31 / MAX: 5.65MIN: 4.87 / MAX: 5.01

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.0.1Binary: Pathtracer ISPC - Model: Asian Dragonc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD246810SE +/- 0.0079, N = 3SE +/- 0.0119, N = 3SE +/- 0.0155, N = 3SE +/- 0.0079, N = 3SE +/- 0.0540, N = 3SE +/- 0.0077, N = 37.36425.15424.57684.59946.41735.8350MIN: 7.33 / MAX: 7.44MIN: 5.12 / MAX: 5.22MIN: 4.49 / MAX: 4.73MIN: 4.54 / MAX: 4.72MIN: 6.23 / MAX: 6.66MIN: 5.78 / MAX: 5.94

uvg266

Video Input: Bosphorus 4K - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Very Fastc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD3691215SE +/- 0.03, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 3SE +/- 0.00, N = 3SE +/- 0.06, N = 3SE +/- 0.03, N = 36.995.694.974.9310.687.90

uvg266

Video Input: Bosphorus 4K - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Super Fastc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD3691215SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.03, N = 3SE +/- 0.01, N = 37.486.035.285.1711.368.14

uvg266

Video Input: Bosphorus 4K - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 4K - Video Preset: Ultra Fastc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.00, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 39.127.456.516.4113.679.82

uvg266

Video Input: Bosphorus 1080p - Video Preset: Very Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Very Fastc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1122334455SE +/- 0.25, N = 3SE +/- 0.16, N = 3SE +/- 0.15, N = 3SE +/- 0.12, N = 3SE +/- 0.51, N = 3SE +/- 0.37, N = 332.3926.3022.8122.8749.1936.45

uvg266

Video Input: Bosphorus 1080p - Video Preset: Super Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Super Fastc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1224364860SE +/- 0.03, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.12, N = 3SE +/- 0.04, N = 334.5027.7224.1524.1552.2537.79

uvg266

Video Input: Bosphorus 1080p - Video Preset: Ultra Fast

OpenBenchmarking.orgFrames Per Second, More Is Betteruvg266 0.4.1Video Input: Bosphorus 1080p - Video Preset: Ultra Fastc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1428425670SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.03, N = 3SE +/- 0.33, N = 3SE +/- 0.08, N = 342.2434.4730.0529.9761.8545.53

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RT.hdr_alb_nrm.3840x2160c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD0.06750.1350.20250.270.3375SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.240.220.200.200.300.16

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 1.4.0Run: RTLightmap.hdr.4096x4096c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD0.03380.06760.10140.13520.169SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 3SE +/- 0.00, N = 30.120.110.100.090.150.08

OpenVKL

Benchmark: vklBenchmark ISPC

OpenBenchmarking.orgItems / Sec, More Is BetterOpenVKL 1.3.1Benchmark: vklBenchmark ISPCc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD20406080100SE +/- 0.33, N = 3SE +/- 0.00, N = 3SE +/- 0.33, N = 3SE +/- 0.33, N = 3SE +/- 0.58, N = 3SE +/- 0.58, N = 3987063817562MIN: 11 / MAX: 1579MIN: 8 / MAX: 1119MIN: 7 / MAX: 1024MIN: 8 / MAX: 1030MIN: 9 / MAX: 997MIN: 7 / MAX: 923

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression Ratingc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD10K20K30K40K50KSE +/- 246.17, N = 15SE +/- 248.15, N = 3SE +/- 125.77, N = 3SE +/- 324.52, N = 4SE +/- 126.17, N = 3SE +/- 371.22, N = 33530630989280622809348449392031. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Timed FFmpeg Compilation

Time To Compile

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed FFmpeg Compilation 6.0Time To Compilec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD306090120150SE +/- 0.10, N = 3SE +/- 0.03, N = 3SE +/- 1.17, N = 3SE +/- 0.08, N = 3SE +/- 0.17, N = 3SE +/- 0.01, N = 3120.44139.04154.90154.9080.65119.64

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70140210280350SE +/- 0.64, N = 3SE +/- 0.85, N = 3SE +/- 1.15, N = 3SE +/- 0.77, N = 3SE +/- 0.35, N = 3SE +/- 0.57, N = 3244.80289.66322.38327.76184.68259.34

oneDNN

Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: IP Shapes 1D - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLX510152025SE +/- 0.00340, N = 3SE +/- 0.01518, N = 3SE +/- 0.02550, N = 3SE +/- 0.02769, N = 31.5000419.5287022.0252022.03560MIN: 18.98MIN: 21.41MIN: 21.371. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: IP Shapes 3D - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLX246810SE +/- 0.00535, N = 3SE +/- 0.01190, N = 3SE +/- 0.02441, N = 3SE +/- 0.03681, N = 35.342188.024168.518648.62637MIN: 4.94MIN: 7.86MIN: 8.13MIN: 8.151. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Convolution Batch Shapes Auto - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLX918273645SE +/- 0.00870, N = 3SE +/- 0.00489, N = 3SE +/- 0.03521, N = 3SE +/- 0.00800, N = 34.1770734.1893038.8717038.96760MIN: 33.77MIN: 38.45MIN: 38.581. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Deconvolution Batch shapes_1d - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLX1428425670SE +/- 0.00472, N = 3SE +/- 0.00209, N = 3SE +/- 0.04947, N = 3SE +/- 0.01683, N = 31.4714552.9421060.5925060.55490MIN: 52.7MIN: 60.11MIN: 60.161. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Deconvolution Batch shapes_3d - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLX918273645SE +/- 0.01072, N = 3SE +/- 0.01009, N = 3SE +/- 0.03334, N = 3SE +/- 0.03843, N = 33.5494435.9164040.6542040.77870MIN: 35.71MIN: 40.35MIN: 40.381. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Training - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD14002800420056007000SE +/- 0.86, N = 3SE +/- 8.38, N = 3SE +/- 2.66, N = 3SE +/- 9.18, N = 3SE +/- 17.70, N = 3SE +/- 4.35, N = 34660.705767.706078.816112.654004.686503.63MIN: 4648.25MIN: 5732.12MIN: 6032.59MIN: 6059.51MIN: 3904.56MIN: 6453.161. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Recurrent Neural Network Inference - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD7001400210028003500SE +/- 1.76, N = 3SE +/- 0.83, N = 3SE +/- 31.87, N = 3SE +/- 3.96, N = 3SE +/- 7.71, N = 3SE +/- 3.60, N = 32337.312998.033172.343157.382107.393335.70MIN: 2326.77MIN: 2977.98MIN: 3085.07MIN: 3110.9MIN: 2012.96MIN: 3294.121. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

oneDNN

Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPU

OpenBenchmarking.orgms, Fewer Is BetteroneDNN 3.0Harness: Matrix Multiply Batch Shapes Transformer - Data Type: bf16bf16bf16 - Engine: CPUc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLX246810SE +/- 0.013114, N = 3SE +/- 0.007542, N = 3SE +/- 0.005782, N = 3SE +/- 0.006708, N = 30.9689867.4326008.5067808.445150MIN: 7.25MIN: 8.27MIN: 8.221. (CXX) g++ options: -O3 -march=native -fopenmp -msse4.1 -fPIC -pie -ldl

OSPRay Studio

Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 1 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracerc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD8K16K24K32K40KSE +/- 4.91, N = 3SE +/- 32.33, N = 3SE +/- 166.18, N = 3SE +/- 93.88, N = 3SE +/- 93.60, N = 3SE +/- 54.89, N = 32095230911358843519325306313701. (CXX) g++ options: -O3 -lm -ldl

OSPRay Studio

Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracer

OpenBenchmarking.orgms, Fewer Is BetterOSPRay Studio 0.11Camera: 3 - Resolution: 4K - Samples Per Pixel: 1 - Renderer: Path Tracerc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD13K26K39K52K65KSE +/- 364.36, N = 3SE +/- 39.94, N = 3SE +/- 134.74, N = 3SE +/- 131.72, N = 3SE +/- 224.15, N = 3SE +/- 47.59, N = 32581937892613376115030299372271. (CXX) g++ options: -O3 -lm -ldl

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1500M3000M4500M6000M7500MSE +/- 2722265.85, N = 3SE +/- 28303.98, N = 3SE +/- 1532515.57, N = 3SE +/- 191767.32, N = 3SE +/- 1526902.57, N = 3SE +/- 2236930.48, N = 34283873987131819353011785138731177333613687612773058241798831. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD700M1400M2100M2800M3500MSE +/- 1152941.98, N = 3SE +/- 2965153.70, N = 3SE +/- 3181911.70, N = 3SE +/- 2213714.99, N = 3SE +/- 1798677.56, N = 3SE +/- 256893.72, N = 31568572920146581522713081540271306223720315865481718633520671. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD400800120016002000SE +/- 1.17, N = 3SE +/- 2.05, N = 3SE +/- 2.24, N = 3SE +/- 3.47, N = 3SE +/- 0.38, N = 3SE +/- 0.28, N = 32062.71156.61028.61028.81765.01025.71. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD6000M12000M18000M24000M30000MSE +/- 35781789.46, N = 3SE +/- 1497684.67, N = 3SE +/- 15025960.55, N = 3SE +/- 2986147.97, N = 3SE +/- 5497165.35, N = 3SE +/- 9826068.49, N = 32209155763721346811813190551302171906548590326325286910172665177901. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD12000M24000M36000M48000M60000MSE +/- 33921495.47, N = 3SE +/- 6165180.91, N = 3SE +/- 1982730.14, N = 3SE +/- 2887046.99, N = 3SE +/- 13818265.47, N = 3SE +/- 281398.00, N = 35759407782323237312603207652564532075355017733948315373185224230231. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD10000M20000M30000M40000M50000MSE +/- 48074910.93, N = 3SE +/- 3855504.36, N = 3SE +/- 5869541.98, N = 3SE +/- 6267687.89, N = 3SE +/- 31574149.68, N = 3SE +/- 2631212.92, N = 34800836157316945000630151292380031513385492331447472720169689386531. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD4000M8000M12000M16000M20000MSE +/- 21990039.72, N = 3SE +/- 1592104.52, N = 3SE +/- 2582153.15, N = 3SE +/- 3854876.66, N = 3SE +/- 35085963.78, N = 3SE +/- 378013.69, N = 315970781140109191518439576513353956747430017592600077117593264571. (CC) gcc options: -pthread -m64 -O3 -lssl -lcrypto -ldl

CockroachDB

Workload: KV, 50% Reads - Concurrency: 128

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 50% Reads - Concurrency: 128c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD4K8K12K16K20KSE +/- 40.86, N = 3SE +/- 68.65, N = 3SE +/- 70.13, N = 3SE +/- 40.52, N = 3SE +/- 5.78, N = 3SE +/- 73.34, N = 319321.613684.512699.112635.119110.417168.4

CockroachDB

Workload: KV, 95% Reads - Concurrency: 128

OpenBenchmarking.orgops/s, More Is BetterCockroachDB 22.2Workload: KV, 95% Reads - Concurrency: 128c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD5K10K15K20K25KSE +/- 127.78, N = 3SE +/- 79.71, N = 3SE +/- 39.57, N = 3SE +/- 99.60, N = 3SE +/- 53.72, N = 3SE +/- 44.80, N = 324960.116184.214976.814884.722148.820571.8

Memcached

Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.18Set To Get Ratio: 1:10c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD300K600K900K1200K1500KSE +/- 3280.78, N = 3SE +/- 2213.84, N = 3SE +/- 3190.98, N = 3SE +/- 2135.86, N = 3SE +/- 10218.36, N = 3SE +/- 5221.50, N = 31044947.13715723.53641713.60614742.621200527.42842830.771. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Memcached

Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterMemcached 1.6.18Set To Get Ratio: 1:100c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD300K600K900K1200K1500KSE +/- 11157.73, N = 3SE +/- 3476.80, N = 3SE +/- 4342.69, N = 3SE +/- 1810.97, N = 3SE +/- 15912.40, N = 3SE +/- 2694.56, N = 31030937.28702291.93624980.66604363.371224550.68835405.061. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_barec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD0.24710.49420.74130.98841.2355SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.001, N = 3SE +/- 0.009, N = 3SE +/- 0.001, N = 30.7770.5790.5280.5291.0980.6901. (CXX) g++ options: -O3

MariaDB

Clients: 2048

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 2048c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70140210280350SE +/- 3.01, N = 3SE +/- 3.50, N = 3SE +/- 2.04, N = 7SE +/- 1.89, N = 3SE +/- 7.31, N = 6SE +/- 2.57, N = 33322482202203013051. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

MariaDB

Clients: 4096

OpenBenchmarking.orgQueries Per Second, More Is BetterMariaDB 11.0.1Clients: 4096c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70140210280350SE +/- 2.62, N = 3SE +/- 2.55, N = 3SE +/- 2.13, N = 3SE +/- 1.01, N = 3SE +/- 2.99, N = 4SE +/- 1.97, N = 33172372072142802921. (CXX) g++ options: -pie -fPIC -fstack-protector -O3 -lnuma -lcrypt -lz -lm -lssl -lcrypto -lpthread -ldl

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Onlyc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70K140K210K280K350KSE +/- 3369.27, N = 3SE +/- 822.93, N = 3SE +/- 259.80, N = 3SE +/- 2195.04, N = 12SE +/- 4510.85, N = 3SE +/- 664.60, N = 33119421733171582671520933233532494901. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 800 - Mode: Read Only - Average Latencyc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1.18672.37343.56014.74685.9335SE +/- 0.028, N = 3SE +/- 0.022, N = 3SE +/- 0.008, N = 3SE +/- 0.088, N = 12SE +/- 0.035, N = 3SE +/- 0.009, N = 32.5654.6165.0555.2742.4753.2061. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only

OpenBenchmarking.orgTPS, More Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 1000 - Mode: Read Onlyc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD70K140K210K280K350KSE +/- 2414.33, N = 3SE +/- 1014.66, N = 3SE +/- 1490.26, N = 12SE +/- 2071.56, N = 3SE +/- 2738.70, N = 3SE +/- 2132.82, N = 32937251696431551961457903208272441981. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

PostgreSQL

Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latency

OpenBenchmarking.orgms, Fewer Is BetterPostgreSQL 15Scaling Factor: 100 - Clients: 1000 - Mode: Read Only - Average Latencyc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD246810SE +/- 0.028, N = 3SE +/- 0.035, N = 3SE +/- 0.066, N = 12SE +/- 0.097, N = 3SE +/- 0.026, N = 3SE +/- 0.036, N = 33.4055.8956.4506.8623.1184.0961. (CC) gcc options: -fno-strict-aliasing -fwrapv -O2 -lpgcommon -lpgport -lpq -lm

TensorFlow

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 16 - Model: ResNet-50c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD48121620SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.08, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.02, N = 314.2013.3312.1812.1810.748.66

TensorFlow

Device: CPU - Batch Size: 32 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 32 - Model: ResNet-50c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD48121620SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.05, N = 3SE +/- 0.01, N = 314.9314.1112.9512.9910.618.55

TensorFlow

Device: CPU - Batch Size: 64 - Model: ResNet-50

OpenBenchmarking.orgimages/sec, More Is BetterTensorFlow 2.10Device: CPU - Batch Size: 64 - Model: ResNet-50c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD48121620SE +/- 0.01, N = 3SE +/- 0.18, N = 3SE +/- 0.01, N = 3SE +/- 0.01, N = 3SE +/- 0.04, N = 3SE +/- 0.00, N = 315.6914.7913.7413.7010.488.46

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD246810SE +/- 0.0130, N = 3SE +/- 0.0001, N = 3SE +/- 0.0040, N = 3SE +/- 0.0005, N = 3SE +/- 0.0107, N = 3SE +/- 0.0030, N = 33.78732.93082.65872.62686.16113.4194

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD160320480640800SE +/- 1.78, N = 3SE +/- 0.03, N = 3SE +/- 1.14, N = 3SE +/- 0.15, N = 3SE +/- 0.74, N = 3SE +/- 0.39, N = 3528.02682.37752.19761.34645.42584.56

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD612182430SE +/- 0.03, N = 3SE +/- 0.05, N = 3SE +/- 0.06, N = 3SE +/- 0.02, N = 3SE +/- 0.05, N = 3SE +/- 0.04, N = 319.1715.9215.3615.1123.8113.55

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD4080120160200SE +/- 0.14, N = 3SE +/- 0.40, N = 3SE +/- 0.52, N = 3SE +/- 0.19, N = 3SE +/- 0.31, N = 3SE +/- 0.37, N = 3104.31125.47130.02132.22167.67147.48

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1530456075SE +/- 0.04, N = 3SE +/- 0.02, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 3SE +/- 0.10, N = 3SE +/- 0.04, N = 364.8759.9954.9654.5268.8638.12

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1326395265SE +/- 0.02, N = 3SE +/- 0.01, N = 3SE +/- 0.02, N = 3SE +/- 0.04, N = 3SE +/- 0.08, N = 3SE +/- 0.04, N = 330.8033.2936.3436.6457.9952.42

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD1224364860SE +/- 0.15, N = 3SE +/- 0.03, N = 3SE +/- 0.12, N = 3SE +/- 0.20, N = 3SE +/- 0.02, N = 3SE +/- 0.03, N = 333.1129.1825.9725.6951.3528.49

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD20406080100SE +/- 0.28, N = 3SE +/- 0.08, N = 3SE +/- 0.37, N = 3SE +/- 0.60, N = 3SE +/- 0.02, N = 3SE +/- 0.07, N = 360.3968.5176.9777.8177.7870.13

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD246810SE +/- 0.0078, N = 3SE +/- 0.0034, N = 3SE +/- 0.0137, N = 3SE +/- 0.0036, N = 3SE +/- 0.0795, N = 3SE +/- 0.0013, N = 36.53726.18935.74915.64818.16544.5842

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD110220330440550SE +/- 0.37, N = 3SE +/- 0.18, N = 3SE +/- 0.83, N = 3SE +/- 0.22, N = 3SE +/- 4.26, N = 3SE +/- 0.12, N = 3305.90323.08347.83354.05489.08436.25

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD612182430SE +/- 0.13, N = 3SE +/- 0.03, N = 3SE +/- 0.07, N = 3SE +/- 0.01, N = 3SE +/- 0.06, N = 3SE +/- 0.01, N = 316.2213.9912.7412.8826.0614.35

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD306090120150SE +/- 0.95, N = 3SE +/- 0.29, N = 3SE +/- 0.88, N = 3SE +/- 0.06, N = 3SE +/- 0.34, N = 3SE +/- 0.08, N = 3123.29142.89156.90155.23153.16139.20

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD246810SE +/- 0.0239, N = 3SE +/- 0.0038, N = 3SE +/- 0.0025, N = 3SE +/- 0.0045, N = 3SE +/- 0.0070, N = 3SE +/- 0.0121, N = 33.76932.93522.65972.65706.19563.3909

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.3.2Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Streamc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD160320480640800SE +/- 3.34, N = 3SE +/- 0.89, N = 3SE +/- 0.71, N = 3SE +/- 1.28, N = 3SE +/- 0.40, N = 3SE +/- 2.04, N = 3530.61681.34751.94752.68642.98588.61

Google Draco

Model: Lion

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Lionc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD2K4K6K8K10KSE +/- 80.23, N = 15SE +/- 18.52, N = 3SE +/- 75.43, N = 3SE +/- 21.31, N = 3SE +/- 32.94, N = 3SE +/- 18.18, N = 36250743783558391607356171. (CXX) g++ options: -O3

Google Draco

Model: Church Facade

OpenBenchmarking.orgms, Fewer Is BetterGoogle Draco 1.5.6Model: Church Facadec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD3K6K9K12K15KSE +/- 10.48, N = 3SE +/- 16.76, N = 3SE +/- 21.39, N = 3SE +/- 141.00, N = 4SE +/- 40.60, N = 3SE +/- 34.49, N = 37573114621233212882875474421. (CXX) g++ options: -O3

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.4Blend File: BMW27 - Compute: CPU-Onlyc3-highcpu-8 SPR70140210280350SE +/- 1.13, N = 3315.24

nginx

Connections: 100

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 100c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD8K16K24K32K40KSE +/- 23.95, N = 3SE +/- 33.32, N = 3SE +/- 51.85, N = 3SE +/- 51.88, N = 3SE +/- 123.11, N = 3SE +/- 22.78, N = 336310.3525148.2822830.1922444.4035257.7725019.861. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 200

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 200c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD8K16K24K32K40KSE +/- 83.52, N = 3SE +/- 37.62, N = 3SE +/- 116.88, N = 3SE +/- 136.11, N = 3SE +/- 259.52, N = 3SE +/- 298.83, N = 335602.1024695.9122570.1821431.4235225.0424677.031. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 500

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 500c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD8K16K24K32K40KSE +/- 321.85, N = 3SE +/- 17.44, N = 3SE +/- 51.39, N = 3SE +/- 34.93, N = 3SE +/- 56.19, N = 3SE +/- 81.15, N = 334672.6521957.1720146.1019806.6035127.2624265.311. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 1000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 1000c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD7K14K21K28K35KSE +/- 22.42, N = 3SE +/- 84.22, N = 3SE +/- 19.22, N = 3SE +/- 102.00, N = 3SE +/- 47.86, N = 3SE +/- 10.88, N = 332118.5821446.2719683.6019418.0834079.6223758.471. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

nginx

Connections: 4000

OpenBenchmarking.orgRequests Per Second, More Is Betternginx 1.23.2Connections: 4000c3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD7K14K21K28K35KSE +/- 27.93, N = 3SE +/- 11.52, N = 3SE +/- 55.22, N = 3SE +/- 58.80, N = 3SE +/- 461.85, N = 3SE +/- 84.28, N = 332814.7521594.9419947.5619477.3133592.7723747.521. (CC) gcc options: -lluajit-5.1 -lm -lssl -lcrypto -lpthread -ldl -std=c99 -O2

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.34VGR Performance Metricc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD20K40K60K80K100K7107250314445634455699168674371. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

OpenCV

Test: Core

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Corec3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD40K80K120K160K200KSE +/- 280.31, N = 3SE +/- 2578.21, N = 12SE +/- 3894.18, N = 12SE +/- 2878.10, N = 9SE +/- 3138.88, N = 15SE +/- 678.61, N = 38737214277020927720819990443807711. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OpenCV

Test: Graph API

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Graph APIc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD60K120K180K240K300KSE +/- 931.36, N = 3SE +/- 1570.24, N = 3SE +/- 727.87, N = 3SE +/- 3363.59, N = 9SE +/- 529.01, N = 3SE +/- 2222.25, N = 32199312361862672502891622102141994701. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OpenCV

Test: Stitching

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Stitchingc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD60K120K180K240K300KSE +/- 1973.06, N = 7SE +/- 1856.06, N = 3SE +/- 2268.99, N = 3SE +/- 2338.09, N = 3SE +/- 2789.72, N = 3SE +/- 1971.58, N = 52147602508332784932813381958231874001. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OpenCV

Test: Image Processing

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Image Processingc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD40K80K120K160K200KSE +/- 1624.35, N = 12SE +/- 1527.37, N = 4SE +/- 2291.38, N = 3SE +/- 2082.92, N = 3SE +/- 1246.60, N = 4SE +/- 1358.27, N = 31281631472341787041728421144101176181. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt

OpenCV

Test: Object Detection

OpenBenchmarking.orgms, Fewer Is BetterOpenCV 4.7Test: Object Detectionc3-highcpu-8 SPRc2-standard-8 CLXn2-standard-8 CLXn2-highcpu-8 CLXt2d-standard-8 AMDc2d-highcpu-8 AMD14K28K42K56K70KSE +/- 384.74, N = 5SE +/- 751.87, N = 3SE +/- 842.44, N = 3SE +/- 736.69, N = 4SE +/- 424.20, N = 15SE +/- 509.28, N = 33899958056642846626932014376661. (CXX) g++ options: -fsigned-char -pthread -fomit-frame-pointer -ffunction-sections -fdata-sections -msse -msse2 -msse3 -fvisibility=hidden -O3 -ldl -lm -lpthread -lrt


Phoronix Test Suite v10.8.4