extra tests

benchmarks for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2309060-NE-EXTRATEST87&grs&rdt.

extra testsProcessorMotherboardMemoryDiskGraphicsOSKernelCompilerFile-SystemScreen Resolutiondgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2 x AMD EPYC 9124 16-Core @ 3.00GHz (32 Cores / 64 Threads)Supermicro H13DSH (1.5 BIOS)24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41024x7682 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores / 512 Threads)23 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET2 x AMD EPYC 9334 32-Core @ 2.70GHz (64 Cores / 128 Threads)21 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKETOpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- d: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e- g: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00116- h: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00116- 2 x AMD EPYC 9334 32-Core: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e- 9334 2p: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e- 93334 rep: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113eJava Details- OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Details- Python 3.9.16Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

extra testsstress-ng: Semaphoresstress-ng: Pipedeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamstress-ng: x86_64 RdRandstress-ng: Hashstress-ng: Context Switchingstress-ng: Vector Shuffledeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamncnn: CPU - regnety_400mstress-ng: Floating Pointstress-ng: CPU Stressstress-ng: Function Callstress-ng: Matrix Mathstress-ng: Vector Floating Pointstress-ng: Fused Multiply-Addstress-ng: Zlibdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamblender: Classroom - CPU-Onlystress-ng: Vector Mathblender: Pabellon Barcelona - CPU-Onlystress-ng: AVX-512 VNNIstress-ng: Memory Copyingstress-ng: Wide Vector Mathstress-ng: Glibc Qsort Data Sortingstress-ng: Mallocstress-ng: Glibc C String Functionsblender: Barbershop - CPU-Onlyblender: BMW27 - CPU-Onlyospray: gravity_spheres_volume/dim_512/scivis/real_timeembree: Pathtracer - Asian Dragonembree: Pathtracer - Asian Dragon Objospray: gravity_spheres_volume/dim_512/ao/real_timedeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamembree: Pathtracer - Crownembree: Pathtracer ISPC - Asian Dragonembree: Pathtracer ISPC - Crownstress-ng: Pollblender: Fishy Cat - CPU-Onlyncnn: CPU - blazefaceembree: Pathtracer ISPC - Asian Dragon Objstress-ng: SENDFILEospray: particle_volume/ao/real_timeospray: particle_volume/scivis/real_timestress-ng: Cryptoncnn: CPU - shufflenet-v2stress-ng: Forkingstress-ng: AVL Treestress-ng: CPU Cachestress-ng: MMAPncnn: CPU - efficientnet-b0ncnn: CPU-v3-v3 - mobilenet-v3stress-ng: NUMAncnn: CPU - mnasnetncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - FastestDetdeepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Streamoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlyoidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyncnn: CPU - squeezenet_ssdospray: gravity_spheres_volume/dim_512/pathtracer/real_timeoidn: RTLightmap.hdr.4096x4096 - CPU-Onlydeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdragonflydb: 20 - 1:100dragonflydb: 10 - 1:100ncnn: CPU - resnet18ncnn: CPU - alexnetdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Streamdragonflydb: 10 - 1:10deepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Streamncnn: CPU - vgg16ncnn: CPU - resnet50dragonflydb: 20 - 1:10ncnn: CPU - mobilenetncnn: CPU - vision_transformerstress-ng: Socket Activitybrl-cad: VGR Performance Metricstress-ng: MEMFDncnn: CPU - googlenetdeepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Streamdeepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Streamspecfem3d: Layered Halfspacespecfem3d: Mount St. Helensdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamspecfem3d: Water-layered Halfspacedeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Streamstress-ng: Futexdeepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Streamspecfem3d: Homogeneous Halfspacesvt-av1: Preset 13 - Bosphorus 4Kspecfem3d: Tomographic Modeldeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamremhos: Sample Remap Examplebuild-linux-kernel: defconfigdeepsparse: ResNet-50, Baseline - Synchronous Single-Streamdeepsparse: ResNet-50, Baseline - Synchronous Single-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Streamliquid-dsp: 64 - 256 - 512laghos: Sedov Blast Wave, ube_922_hex.meshstress-ng: Matrix 3D Mathcassandra: Writessvt-av1: Preset 13 - Bosphorus 1080pnekrs: Kershawncnn: CPU - yolov4-tinydeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamnekrs: TurboPipe Periodicstress-ng: System V Message Passingdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamliquid-dsp: 2 - 256 - 32ospray: particle_volume/pathtracer/real_timesvt-av1: Preset 4 - Bosphorus 1080pliquid-dsp: 2 - 256 - 57stress-ng: Mixed Schedulerstress-ng: Pthreadliquid-dsp: 4 - 256 - 512liquid-dsp: 1 - 256 - 32liquid-dsp: 8 - 256 - 32liquid-dsp: 32 - 256 - 32liquid-dsp: 8 - 256 - 512liquid-dsp: 16 - 256 - 32liquid-dsp: 16 - 256 - 512liquid-dsp: 4 - 256 - 32liquid-dsp: 1 - 256 - 57liquid-dsp: 1 - 256 - 512liquid-dsp: 64 - 256 - 57liquid-dsp: 2 - 256 - 512liquid-dsp: 32 - 256 - 512stress-ng: Mutexliquid-dsp: 64 - 256 - 32svt-av1: Preset 4 - Bosphorus 1080pstress-ng: Atomicliquid-dsp: 8 - 256 - 57deepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamliquid-dsp: 16 - 256 - 57deepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamsvt-av1: Preset 12 - Bosphorus 4Kliquid-dsp: 4 - 256 - 57liquid-dsp: 32 - 256 - 57svt-av1: Preset 4 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 4 - Bosphorus 4Kdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamsvt-av1: Preset 12 - Bosphorus 4Kdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamsvt-av1: Preset 12 - Bosphorus 1080pdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamsvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080psvt-av1: Preset 8 - Bosphorus 1080psvt-av1: Preset 8 - Bosphorus 1080pvvenc: Bosphorus 4K - Fastvvenc: Bosphorus 1080p - Faststress-ng: Cloningvvenc: Bosphorus 4K - Fastervvenc: Bosphorus 1080p - Fastersvt-av1: Preset 8 - Bosphorus 4Ksvt-av1: Preset 8 - Bosphorus 4Kkripke: laghos: Triple Point Problemdragonflydb: 50 - 1:100dragonflydb: 50 - 1:10build-linux-kernel: allmodconfigdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep92019395.1320249488.61373.0279257.4237258.5599757.47724.7153355.8877114.282113.6578168.724785.485419.876319.8112013602.067218607.1217931974.9225006.7336.384934.5711939.4188466.5427731.16173878.7106715.8532151217.032944.542661.225594.88234239.6122.73667964.1513428.641543291.91916.74137476652.9340991307.95372.7237.939.8131842.764137.987610.0774106.146737.67748.474639.38414337787.7248.955.8440.5726857324.8910.863410.8453107643.4915.461007.46410.89776708.721131.5515.4312.0118.3510.8311.831614.745467.765714.663268.16211.361.3725.6111.90020.6571.3514.013971.836213.919214250093.7111860591.6513.758.144.9317202.501911686474.2640.734524.53628.5726.6815179164.8623.9755.819504.38544648912.6230.111.1189890.944811.086890.075638.48977964914.5056452535.9947150.27634.8222274827.1456139.79344022496.0758.827916.996118.85382887614.87771034620.71448.251220.90435.2837.1102140.45687.1228140.2014504660000263.8510869.492304021152090000040.3419.635650.908974704900006851573.210.081399.1222436.974968834000176.61414.97410563000035640.3670043.7849884000351950002717300001073800000964490005493900001955100001365000005276500012652000177860000024900000387460000438280.162045600000236.99331200000794.0387650410000795.852199.52317428000012345000005.053195.829140.5767139.6127186.648294.619744.914842.8337641.605121.0894419.85361.805362.1006545.195136.0716.76819.1661170.3112.67833.57272.353372072000195.4017583489.3815253621.53800762640.11173499514.942613.0881780.23791778.36165150.4252166.73352405.8755770.5065761.86571128.8846566.7259129.4385129.384978673620.1445298625.91110432440.32154267.34220.9069202.3471782.95523898.45156381.63989553.8604724.84180789739.0816624.0714854.85617.311288008.2622.420133541.1172825.568237581.374846.22719479964.33215429087.6171.977.5149.5123217.591193.050250.4014497.8976183.4023239.4187193.604121346079.310.0226.9193.42973761786.2946.656746.6054462528.7357.6731064.951564.42334039.113664.3447.6735.9614.3829.9231.2444.275.4661182.64175.4478183.37173.403.2858.6628.35211.5030.223533.077931.451931.786529.2615.17.3831135.395520.598148.498956.3453.6241.21108.73110254.19639840497.9457.721.6492605.467213.847372.1848.5843256.16428.7387114.37842951631.9642.038823.7821203.00715.022366.522323.0587.2026138.75847.2111138.59465914000014658.75185134599.57348.0814.415969.32717729966.178.0302124.4072573.358255708000161.16913.5758810900029932.254906.434164100029219000228520000914570000830900004578200001651800001145500004418700010576000207660000020660000331240000364035.5718276000009.831204.44279840000969.2352542300000969.8849189.16415101000010573000004.615191.6654.124167.4236165.5104224.6455113.019153.115648.8841756.6198179.40524.7947490.15471.821471.7619511.795600.23125.22134.8751139.8792.64175.15786069338.66159491222.342633.5921784.35591786.17565175.4863168.75062429.7418774.4054764.73741127.7925569.3208131.5432131.101378690548.4945265797.34110991931.6154293.33222.8586201.5971790.06518600.85158994.25990104.85605340.53181626937.8516625.5713153.495817.121285121.422.3520123210.1472800.748234456.234850.99725705488.15215308424.8171.397.450.0879207.2492172.206850.9618532.4594187.4161225.8305194.274320251576.3310.0223.63184.45373745995.1646.756246.5528461785.3645.2230591.671628.91361173.713749.4748.7637.5614.4629.5530.8942.35.4481183.23655.4268184.09213.533.4658.4128.25371.5430.365132.922630.723132.540629.5617.397.4589134.015919.744650.597558.8554.4744.1493.15113347.776270577115.8355.361.6419608.161112.828677.91699.6974239.19158.6016116.19272880825.1135.850827.8849197.0314.564968.612323.0467.1792139.21457.1163140.440365282000015802.86190955600.76248.8213.96571.56547761412.617.6734130.1929567.85555923000160.8712.9138804200029481.4555328.714105200029200000228090000911870000827490004574200001653900001143900004418300010567000207200000020837000329210000363892.118221000009.901203.49277250000952.8689546690000954.9445194.86315536000010650000004.593177.3634.204166.7561164.6605223.8503113.202752.57748.5107751.7219187.67724.6777485.59371.535571.5815507.017581.067124.695134.381209.5394.83173.25226498215.4439461287.6773.5125519.8211519.99321522.046249.7556718.354233.9079231.4583333.851169.271840.574540.247725697538.815425278.8329340312.9652591.7770.902954.5924103.76176344.3854824.58314789.68198651.8858550931.395876.155651.571151.86430109.2663.376626091.9326146.712804498.361675.93262321226.7872167140.03201.5320.6517.435775.282567.483717.8136204.414969.487686.78572.65149104744.1526.437.272.25531312330.4420.398420.3347156553.5118.631641.77788.211095211.772289.819.7914.8742.3313.7413.9416.448.9324111.8518.8864112.46812.212.2125.5120.34161.0444.304422.567344.739222.347831139658.4925802999.0315.169.393.5591280.781922176931.9826.50637.698136.6529.6231053489.2421.853.7938147.9710125371004.7931.030.87111144.30647.5379132.502221.6690898468.4554735345.6462156.059821.3919205075.3972185.02424717570.3637.142126.917311.61697014127.1369.48714578113.306475.100413.51224.744.809207.71534.8028207.99754210000389.8614005.29255016416.173809563000034.6315.021666.536454411000009268236.48.0098124.7249448.734872597000204.93216.19711132000037887.1470405.47523150003717000028873000011581000001043100005755100002086300001442100005415500013207000224720000025706000413970000457807.79228470000012.06253.6346260000776.9139658580000780.5977163.44218539000012930000005.599162.8554.914137.9127136.595188.743295.739244.487341.3109635.6357158.70520.9994442.5561.457261.4884462.328562.313142.186149.1087.39720.7271245.2113.73236.02589.90577.219391567400198.94212194044.2142013739.43779.2277521.805522.111519.29549.8006719.7723234.1305232.2319334.5256168.338740.550940.446525688067.5215412840.2135971998.9852548.2570.207753.7824101.4176621.2754852.06315324.18198855.7558491170.755885.265626.770552.77430141.5463.456624476.3626144.82802185.771674.65263438850.6477814243.76201.5120.5117.330775.456467.323717.7639204.347769.671586.528872.63629067852.1926.158.1572.77461465208.8120.452620.3981156859.2218.351615793.081135334.92299.2819.9215.1841.4913.6914.0222.298.938111.77868.8725112.64632.202.2028.4720.38911.0344.745722.344744.627922.403929330317.3422585212.5617.4910.73.5922278.179424460221.4826.477237.738436.930.628043162.327.4166.6537674.679943711014.2433.40.86771149.12557.7484128.907721.5373245738.3200918565.6693156.277320.8594721725.3028188.31794644551.2937.297326.805511.643695909155.1129.41908663513.296775.1513.96224.6714.7159211.81054.7942208.3426755970000388.86519828914022.5258638560.785814712000040.1915.086866.247354260300009258853.698.0136124.6687451.054472480000209.60916.29511133000037702.2270476.97524690003712600028843000011587000001051700005795100002078100001448000005562400013357000223160000025885000414060000456942.16229130000012.358254.16336320000779.176674330000781.5381160.62618536000012750000005.643163.4255.012137.4827136.4554189.515895.536444.410641.0198635.6516159.25821.0312442.43661.218461.2502458.518520.167140.171150.3467.44720.882119013.68435.57596.10576.503393276500199.20214488001.2142047609.47777.8507520.3204519.7251521.745149.8729715.5913233.339231.4234334.2742168.63140.379640.286825663966.0215326552.1435280554.4252511.0770.824468.6824129.62176710.8554675.85314753.85198610.9358570424.415875.455618.588951.99430096.5563.716621974.9826129.52802578.871676.27263330738.6372810418.71201.9320.4317.278975.11766.850717.8324204.221169.48586.55572.17499078474.0126.378.3972.84651536517.6520.388220.3139156863.7818.471616.05797.641096840.322287.6919.9615.8742.114.2415.0221.58.9437111.70148.8896112.43152.192.2134.9920.33161.0444.358522.539844.661122.386929280070.9724588214.8116.2210.593.5347282.716523645213.1326.479537.735242.0841.0229580028.0627.9561.2438727.869998931009.4733.680.87041145.23438.1347122.790521.6888129918.486910525.6773156.192821.1513706085.2862188.92434752131.3737.574226.607611.533002652152.5839.37456308213.380174.68413.82524.7844.7957208.31034.8205207.2316753120000387.15078606914009.23268984517.438813095000043.5915.082966.267754372400009251814.097.9992124.8961448.886372532000208.8616.6788624800037534518210003718300029032000011595000001022700005809200002093700001438300005588200013320000219430000026081000415200000457061.39229170000012.25254.23329180000779.8287656510000781.9977161.05318667000013024000005.65160.0785.03137.9731136.8061189.326195.597144.680941.101635.1989161.66620.9984416.14961.49161.4475443.322593.633137.614148.8467.35821.0051188.6413.80334.7694.72677.191375715500197.19OpenBenchmarking.org

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Semaphoresdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep200M400M600M800M1000M92019395.13800762640.11786069338.66226498215.44212194044.21214488001.211. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pipe

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Pipedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep40M80M120M160M200M20249488.61173499514.94159491222.3439461287.6042013739.4342047609.471. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep6001200180024003000373.032613.092633.59773.51779.23777.85

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep400800120016002000257.421780.241784.36519.82521.81520.32

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep400800120016002000258.561778.361786.18519.99522.11519.73

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep11002200330044005500757.485150.435175.491522.051519.301521.75

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020024.72166.73168.7549.7649.8049.87

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep5001000150020002500355.892405.882429.74718.35719.77715.59

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep170340510680850114.28770.51774.41233.91234.13233.34

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep160320480640800113.66761.87764.74231.46232.23231.42

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2004006008001000168.721128.881127.79333.85334.53334.27

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep12024036048060085.49566.73569.32169.27168.34168.63

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30609012015019.88129.44131.5440.5740.5540.38

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30609012015019.81129.38131.1040.2540.4540.29

Stress-NG

Test: x86_64 RdRand

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: x86_64 RdRanddgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep20M40M60M80M100M12013602.0678673620.1478690548.4925697538.8025688067.5225663966.021. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Hash

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Hashdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep10M20M30M40M50M7218607.1245298625.9145265797.3415425278.8315412840.2115326552.141. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Context Switchingdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep20M40M60M80M100M17931974.92110432440.32110991931.6029340312.9635971998.9835280554.421. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Shuffledgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30K60K90K120K150K25006.73154267.34154293.3352591.7752548.2552511.071. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep5010015020025036.38220.91222.8670.9070.2170.82

NCNN

Target: CPU - Model: regnety_400m

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020033.50202.34201.5954.5953.7868.68MIN: 33.26 / MAX: 37.92MIN: 187.7 / MAX: 1161.84MIN: 191.08 / MAX: 1525.04MIN: 50.39 / MAX: 663.91MIN: 53.02 / MAX: 60.41MIN: 53.81 / MAX: 3458.691. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Floating Pointdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep15K30K45K60K75K11939.4171782.9571790.0624103.7624101.4024129.621. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU Stressdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep110K220K330K440K550K88466.54523898.45518600.85176344.38176621.27176710.851. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Function Call

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Function Calldgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30K60K90K120K150K27731.16156381.63158994.2554824.5854852.0654675.851. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix Mathdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep200K400K600K800K1000K173878.70989553.80990104.85314789.68315324.18314753.851. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Floating Pointdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep130K260K390K520K650K106715.85604724.84605340.53198651.88198855.75198610.931. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Fused Multiply-Adddgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep40M80M120M160M200M32151217.03180789739.08181626937.8558550931.3958491170.7558570424.411. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Zlib

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Zlibdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4K8K12K16K20K2944.5416624.0716625.575876.155885.265875.451. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep3K6K9K12K15K2661.2314854.8613153.505651.575626.775618.59

Blender

Blend File: Classroom - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2040608010094.8817.3117.1251.8652.7751.99

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Mathdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep300K600K900K1200K1500K234239.601288008.261285121.40430109.26430141.54430096.551. (CXX) g++ options: -O2 -std=gnu99 -lc

Blender

Blend File: Pabellon Barcelona - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep306090120150122.7022.4022.3563.3763.4563.71

Stress-NG

Test: AVX-512 VNNI

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVX-512 VNNIdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4M8M12M16M20M3667964.1520133541.1120123210.146626091.936624476.366621974.981. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Memory Copyingdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep16K32K48K64K80K13428.6472825.5672800.7426146.7126144.8026129.501. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Wide Vector Mathdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2M4M6M8M10M1543291.918237581.378234456.232804498.362802185.772802578.871. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Glibc Qsort Data Sorting

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Glibc Qsort Data Sortingdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep10002000300040005000916.744846.224850.991675.931674.651676.271. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Malloc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mallocdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep160M320M480M640M800M137476652.93719479964.33725705488.15262321226.78263438850.64263330738.631. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Glibc C String Functions

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Glibc C String Functionsdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep50M100M150M200M250M40991307.95215429087.61215308424.8172167140.0377814243.7672810418.711. (CXX) g++ options: -O2 -std=gnu99 -lc

Blender

Blend File: Barbershop - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep80160240320400372.7271.9771.39201.53201.51201.93

Blender

Blend File: BMW27 - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep91827364537.937.517.4020.6520.5120.43

OSPRay

Benchmark: gravity_spheres_volume/dim_512/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep11223344559.8131849.5123050.0879017.4357017.3307017.27890

Embree

Binary: Pathtracer - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragondgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep5010015020025042.76217.59207.2575.2875.4675.12MIN: 42.48 / MAX: 43.18MIN: 211.81 / MAX: 228.26MIN: 203.29 / MAX: 214.14MIN: 73.5 / MAX: 78.24MIN: 73.59 / MAX: 78.22MIN: 73.42 / MAX: 78.19

Embree

Binary: Pathtracer - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragon Objdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020037.99193.05172.2167.4867.3266.85MIN: 37.71 / MAX: 38.4MIN: 174.03 / MAX: 205.21MIN: 150.53 / MAX: 183.32MIN: 65.8 / MAX: 70.02MIN: 65.8 / MAX: 70.31MIN: 65.41 / MAX: 70.72

OSPRay

Benchmark: gravity_spheres_volume/dim_512/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445510.0850.4050.9617.8117.7617.83

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep120240360480600106.15497.90532.46204.41204.35204.22

Embree

Binary: Pathtracer - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Crowndgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020037.68183.40187.4269.4969.6769.49MIN: 37.26 / MAX: 39.44MIN: 179.05 / MAX: 191.46MIN: 183.21 / MAX: 193.83MIN: 65.97 / MAX: 75.27MIN: 66.25 / MAX: 75.27MIN: 65.87 / MAX: 75.16

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragondgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep5010015020025048.47239.42225.8386.7986.5386.56MIN: 48.1 / MAX: 49.28MIN: 232.82 / MAX: 253.03MIN: 221.04 / MAX: 232.51MIN: 83.33 / MAX: 93.39MIN: 83.4 / MAX: 92.47MIN: 83.15 / MAX: 93.4

Embree

Binary: Pathtracer ISPC - Model: Crown

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crowndgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020039.38193.60194.2772.6572.6472.17MIN: 38.86 / MAX: 40.97MIN: 188.25 / MAX: 201.2MIN: 188.82 / MAX: 203.12MIN: 68.96 / MAX: 77.69MIN: 69.23 / MAX: 79.15MIN: 68.85 / MAX: 77.58

Stress-NG

Test: Poll

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Polldgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep5M10M15M20M25M4337787.7221346079.3020251576.339104744.159067852.199078474.011. (CXX) g++ options: -O2 -std=gnu99 -lc

Blender

Blend File: Fishy Cat - Compute: CPU-Only

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445548.9510.0210.0226.4326.1526.37

NCNN

Target: CPU - Model: blazeface

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefacedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep6121824305.6326.9023.637.208.158.39MIN: 5.53 / MAX: 5.95MIN: 21.88 / MAX: 899.43MIN: 21.98 / MAX: 379.78MIN: 7.05 / MAX: 7.44MIN: 7.89 / MAX: 14.8MIN: 8.09 / MAX: 17.411. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Embree

Binary: Pathtracer ISPC - Model: Asian Dragon Obj

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragon Objdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020040.57193.43184.4572.2672.7772.85MIN: 40.28 / MAX: 41.19MIN: 170.24 / MAX: 203.63MIN: 159.04 / MAX: 199.2MIN: 69.72 / MAX: 77.54MIN: 70.2 / MAX: 78.62MIN: 70.3 / MAX: 76.96

Stress-NG

Test: SENDFILE

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: SENDFILEdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep800K1600K2400K3200K4000K857324.893761786.293745995.161312330.441465208.811536517.651. (CXX) g++ options: -O2 -std=gnu99 -lc

OSPRay

Benchmark: particle_volume/ao/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445510.8646.6646.7620.4020.4520.39

OSPRay

Benchmark: particle_volume/scivis/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445510.8546.6146.5520.3320.4020.31

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Cryptodgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep100K200K300K400K500K107643.49462528.73461785.36156553.51156859.22156863.781. (CXX) g++ options: -O2 -std=gnu99 -lc

NCNN

Target: CPU - Model: shufflenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep132639526513.9657.6745.2218.6318.3518.47MIN: 13.65 / MAX: 18.71MIN: 40.74 / MAX: 1797.58MIN: 40.32 / MAX: 296.92MIN: 18.34 / MAX: 19.34MIN: 17.76 / MAX: 24.49MIN: 18.09 / MAX: 20.531. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: Forking

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Forkingdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep7K14K21K28K35K1007.4631064.9530591.671641.771615.001616.051. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: AVL Tree

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVL Treedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30060090012001500410.891564.421628.91788.21793.08797.641. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU Cachedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep200K400K600K800K1000K776708.72334039.11361173.711095211.771135334.901096840.321. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: MMAP

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MMAPdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep80016002400320040001131.553664.343749.472289.802299.282287.691. (CXX) g++ options: -O2 -std=gnu99 -lc

NCNN

Target: CPU - Model: efficientnet-b0

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445514.7647.6748.7619.7919.9219.96MIN: 14.47 / MAX: 29.12MIN: 47.05 / MAX: 112.46MIN: 47.17 / MAX: 210.73MIN: 18.88 / MAX: 151.97MIN: 19.72 / MAX: 21.08MIN: 19.55 / MAX: 33.091. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v3-v3 - Model: mobilenet-v3

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep91827364511.6435.9637.5614.8715.1815.87MIN: 11.48 / MAX: 15.48MIN: 34.15 / MAX: 266.64MIN: 33.74 / MAX: 351.29MIN: 14.55 / MAX: 15.42MIN: 14.82 / MAX: 21.88MIN: 15.57 / MAX: 16.371. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: NUMAdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep102030405018.3514.3814.4642.3341.4942.101. (CXX) g++ options: -O2 -std=gnu99 -lc

NCNN

Target: CPU - Model: mnasnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep71421283510.7029.9229.5513.7413.6914.24MIN: 10.22 / MAX: 14.66MIN: 27.82 / MAX: 234.43MIN: 28.64 / MAX: 97.1MIN: 13.14 / MAX: 22.9MIN: 13.25 / MAX: 14.74MIN: 13.93 / MAX: 14.541. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU-v2-v2 - Model: mobilenet-v2

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep71421283511.2931.2430.8913.9414.0215.02MIN: 10.83 / MAX: 11.91MIN: 30.27 / MAX: 71.7MIN: 29.59 / MAX: 58.99MIN: 13.14 / MAX: 22.71MIN: 13.46 / MAX: 22.99MIN: 14.05 / MAX: 103.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: FastestDet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep102030405016.0044.2742.3016.4422.2921.50MIN: 15.86 / MAX: 20.49MIN: 40.55 / MAX: 822.5MIN: 41.82 / MAX: 83.33MIN: 16.09 / MAX: 32.24MIN: 21.86 / MAX: 23.6MIN: 20.83 / MAX: 30.281. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162014.74545.46615.44818.93248.93808.9437

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020067.77182.64183.24111.85111.78111.70

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162014.66325.44785.42688.88648.87258.8896

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep408012016020068.16183.37184.09112.47112.65112.43

Intel Open Image Denoise

Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep0.79431.58862.38293.17723.97151.363.403.532.212.202.19

Intel Open Image Denoise

Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep0.77851.5572.33553.1143.89251.373.283.462.212.202.21

NCNN

Target: CPU - Model: squeezenet_ssd

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssddgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep132639526524.2258.6658.4125.5128.4734.99MIN: 23.8 / MAX: 28.97MIN: 55.23 / MAX: 333.73MIN: 57.55 / MAX: 98.8MIN: 23.43 / MAX: 95.27MIN: 28.05 / MAX: 34.83MIN: 28.93 / MAX: 1349.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OSPRay

Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep71421283511.9028.3528.2520.3420.3920.33

Intel Open Image Denoise

Run: RTLightmap.hdr.4096x4096 - Device: CPU-Only

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-Onlydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep0.34650.6931.03951.3861.73250.651.501.541.041.031.04

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep163248648071.3530.2230.3744.3044.7544.36

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep81624324014.0133.0832.9222.5722.3422.54

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep163248648071.8431.4530.7244.7444.6344.66

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep81624324013.9231.7932.5422.3522.4022.39

Dragonflydb

Clients Per Thread: 20 - Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:100d2 x AMD EPYC 9334 32-Core9334 2p93334 rep7M14M21M28M35M14250093.7131139658.4929330317.3429280070.971. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 10 - Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:100d2 x AMD EPYC 9334 32-Core9334 2p93334 rep6M12M18M24M30M11860591.6525802999.0322585212.5624588214.811. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

NCNN

Target: CPU - Model: resnet18

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep71421283513.6529.2629.5615.1617.4916.22MIN: 13.29 / MAX: 15.11MIN: 26.62 / MAX: 124.57MIN: 27.99 / MAX: 114.19MIN: 14.45 / MAX: 22.73MIN: 16.57 / MAX: 24.11MIN: 15.17 / MAX: 57.271. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: alexnet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep481216208.1315.1017.399.3910.7010.59MIN: 7.95 / MAX: 8.44MIN: 14.22 / MAX: 74.43MIN: 14.86 / MAX: 374.11MIN: 9 / MAX: 9.79MIN: 10.12 / MAX: 12.2MIN: 9.51 / MAX: 107.871. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2468104.93177.38317.45893.55913.59223.5347

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep60120180240300202.50135.40134.02280.78278.18282.72

Dragonflydb

Clients Per Thread: 10 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:10d2 x AMD EPYC 9334 32-Core9334 2p93334 rep5M10M15M20M25M11686474.2622176931.9824460221.4823645213.131. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep91827364540.7320.6019.7426.5126.4826.48

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445524.5448.5050.6037.7037.7437.74

NCNN

Target: CPU - Model: vgg16

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep132639526528.6956.3458.8536.6536.9042.08MIN: 28.07 / MAX: 37.47MIN: 50.98 / MAX: 172.01MIN: 55.68 / MAX: 98.78MIN: 33.43 / MAX: 47.15MIN: 34.81 / MAX: 49.99MIN: 36.25 / MAX: 590.931. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: resnet50

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep122436486026.5353.6254.4729.6230.6041.02MIN: 26.13 / MAX: 30.95MIN: 51.72 / MAX: 288.16MIN: 51.64 / MAX: 145.97MIN: 27.91 / MAX: 147.23MIN: 30.17 / MAX: 37.09MIN: 30.92 / MAX: 2215.161. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Dragonflydb

Clients Per Thread: 20 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:10d2 x AMD EPYC 9334 32-Core9334 2p93334 rep7M14M21M28M35M15179164.8631053489.2428043162.3029580028.061. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

NCNN

Target: CPU - Model: mobilenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep102030405023.5541.2144.1421.8027.4127.95MIN: 23.18 / MAX: 27.8MIN: 40.54 / MAX: 69.23MIN: 42.79 / MAX: 118.07MIN: 21.41 / MAX: 29.57MIN: 26.74 / MAX: 33.85MIN: 27.34 / MAX: 34.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

NCNN

Target: CPU - Model: vision_transformer

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2040608010055.05108.7393.1553.7966.6561.24MIN: 54.24 / MAX: 72.37MIN: 94.5 / MAX: 1869.82MIN: 88.26 / MAX: 206.51MIN: 52.29 / MAX: 122.53MIN: 59.07 / MAX: 626.24MIN: 59.18 / MAX: 264.241. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Stress-NG

Test: Socket Activity

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Socket Activitydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep20K40K60K80K100K9504.38110254.19113347.7738147.9737674.6738727.861. (CXX) g++ options: -O2 -std=gnu99 -lc

BRL-CAD

VGR Performance Metric

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metricdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep1.4M2.8M4.2M5.6M7M5446486398404627057710125379943719998931. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

Stress-NG

Test: MEMFD

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MEMFDdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2004006008001000912.6297.94115.831004.791014.241009.471. (CXX) g++ options: -O2 -std=gnu99 -lc

NCNN

Target: CPU - Model: googlenet

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep132639526529.0357.7255.3631.0333.4033.68MIN: 27.97 / MAX: 36.49MIN: 52.79 / MAX: 429.12MIN: 54.42 / MAX: 115.96MIN: 30.4 / MAX: 40.18MIN: 32.92 / MAX: 35.55MIN: 32.93 / MAX: 40.381. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep0.37110.74221.11331.48441.85551.11891.64921.64190.87110.86770.8704

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2004006008001000890.94605.47608.161144.311149.131145.23

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162011.086813.847312.82867.53797.74848.1347

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30609012015090.0872.1877.92132.50128.91122.79

SPECFEM3D

Model: Layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspaced2 x AMD EPYC 9334 32-Core9334 2p93334 rep91827364538.4921.6721.5421.691. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

SPECFEM3D

Model: Mount St. Helens

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helensd2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162014.5056452538.4554735348.3200918568.4869105201. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep36912155.99478.58439.69745.64625.66935.6773

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep60120180240300150.28256.16239.19156.06156.28156.19

SPECFEM3D

Model: Water-layered Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspaced2 x AMD EPYC 9334 32-Core9334 2p93334 rep81624324034.8221.3920.8621.151. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2468107.14568.73878.60165.39725.30285.2862

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200139.79114.38116.19185.02188.32188.92

Stress-NG

Test: Futex

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Futexdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep1000K2000K3000K4000K5000K4022496.072951631.962880825.114717570.364644551.294752131.371. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep132639526558.8342.0435.8537.1437.3037.57

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep71421283517.0023.7827.8826.9226.8126.61

SPECFEM3D

Model: Homogeneous Halfspace

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspaced2 x AMD EPYC 9334 32-Core9334 2p93334 rep51015202518.8511.6211.6411.531. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 13 - Input: Bosphorus 4Kgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200203.01197.03127.14155.11152.581. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SPECFEM3D

Model: Tomographic Model

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modeld2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162014.8777103469.4871457819.4190866359.3745630821. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep51015202520.7115.0214.5613.3113.3013.38

Neural Magic DeepSparse

Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2040608010048.2566.5268.6175.1075.1574.68

Remhos

Test: Sample Remap Example

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Exampled2 x AMD EPYC 9334 32-Core9334 2p93334 rep51015202520.9013.5113.9613.831. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep81624324035.2823.0623.0524.7424.6724.78

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2468107.11027.20267.17924.80904.71594.7957

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep50100150200250140.46138.76139.21207.72211.81208.31

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2468107.12287.21117.11634.80284.79424.8205

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep50100150200250140.20138.59140.44207.99208.34207.23

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep160M320M480M640M800M5046600006591400006528200007542100007559700007531200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Laghos

Test: Sedov Blast Wave, ube_922_hex.mesh

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshd2 x AMD EPYC 9334 32-Core9334 2p93334 rep80160240320400263.85389.86388.87387.151. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix 3D Mathdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep3K6K9K12K15K10869.4914658.7515802.8614005.2914022.5014009.231. (CXX) g++ options: -O2 -std=gnu99 -lc

Apache Cassandra

Test: Writes

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep60K120K180K240K300K230402185134190955255016258638268984

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 13 - Input: Bosphorus 1080pgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep130260390520650599.57600.76416.17560.79517.441. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

nekRS

Input: Kershaw

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawd2 x AMD EPYC 9334 32-Core9334 2p93334 rep2000M4000M6000M8000M10000M115209000008095630000814712000081309500001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

NCNN

Target: CPU - Model: yolov4-tiny

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinydgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445535.7548.0848.8234.6340.1943.59MIN: 34.32 / MAX: 45.5MIN: 42.94 / MAX: 393.57MIN: 45.36 / MAX: 93.88MIN: 32.71 / MAX: 43.98MIN: 39.2 / MAX: 49.85MIN: 35.28 / MAX: 640.681. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep51015202519.6414.4213.9715.0215.0915.08

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep163248648050.9169.3371.5766.5466.2566.27

nekRS

Input: TurboPipe Periodic

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicd2 x AMD EPYC 9334 32-Core9334 2p93334 rep1600M3200M4800M6400M8000M74704900005441100000542603000054372400001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: System V Message Passingdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2M4M6M8M10M6851573.207729966.177761412.619268236.409258853.699251814.091. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep369121510.08138.03027.67348.00988.01367.9992

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30609012015099.12124.41130.19124.72124.67124.90

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep120240360480600436.97573.36567.86448.73451.05448.89

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 2 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep16M32M48M64M80M6883400055708000559230007259700072480000725320001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OSPRay

Benchmark: particle_volume/pathtracer/real_time

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timedgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep50100150200250176.61161.17160.87204.93209.61208.86

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 4 - Input: Bosphorus 1080pdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162014.9713.5812.9116.2016.3016.681. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 2 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep20M40M60M80M100M1056300008810900088042000111320000111330000862480001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Mixed Scheduler

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mixed Schedulerdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep8K16K24K32K40K35640.3629932.2029481.4537887.1437702.2237534.001. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Pthreaddgh2 x AMD EPYC 9334 32-Core9334 2p15K30K45K60K75K70043.7854906.4355328.7170405.4770476.971. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 4 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep11M22M33M44M55M4988400041641000410520005231500052469000518210001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep8M16M24M32M40M3519500029219000292000003717000037126000371830001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 8 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep60M120M180M240M300M2717300002285200002280900002887300002884300002903200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep200M400M600M800M1000M10738000009145700009118700001158100000115870000011595000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 8 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep20M40M60M80M100M9644900083090000827490001043100001051700001022700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep120M240M360M480M600M5493900004578200004574200005755100005795100005809200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep40M80M120M160M200M1955100001651800001653900002086300002078100002093700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 4 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30M60M90M120M150M1365000001145500001143900001442100001448000001438300001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep12M24M36M48M60M5276500044187000441830005415500055624000558820001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 1 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep3M6M9M12M15M1265200010576000105670001320700013357000133200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep500M1000M1500M2000M2500M1778600000207660000020720000002247200000223160000021943000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 2 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 2 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep6M12M18M24M30M2490000020660000208370002570600025885000260810001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 512

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep90M180M270M360M450M3874600003312400003292100004139700004140600004152000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Stress-NG

Test: Mutex

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mutexdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep100K200K300K400K500K438280.16364035.57363892.10457807.79456942.16457061.391. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 64 - Buffer Length: 256 - Filter Length: 32

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep500M1000M1500M2000M2500M2045600000182760000018221000002284700000229130000022917000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 4 - Input: Bosphorus 1080pgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep36912159.8319.90112.06012.35812.2501. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Stress-NG

Test: Atomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Atomicdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep60120180240300236.99204.44203.49253.60254.16254.231. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

Threads: 8 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 8 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep70M140M210M280M350M3312000002798400002772500003462600003363200003291800001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2004006008001000794.04969.24952.87776.91779.18779.83

Liquid-DSP

Threads: 16 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep140M280M420M560M700M6504100005423000005466900006585800006743300006565100001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2004006008001000795.85969.88954.94780.60781.54782.00

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 12 - Input: Bosphorus 4Kdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200199.52189.16194.86163.44160.63161.051. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Liquid-DSP

Threads: 4 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 4 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep40M80M120M160M200M1742800001510100001553600001853900001853600001866700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Liquid-DSP

Threads: 32 - Buffer Length: 256 - Filter Length: 57

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57dgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep300M600M900M1200M1500M1234500000105730000010650000001293000000127500000013024000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 4 - Input: Bosphorus 4Kdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep1.27132.54263.81395.08526.35655.0534.6154.5935.5995.6435.6501. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 13 - Input: Bosphorus 4Kdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200195.83191.67177.36162.86163.43160.081. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 4 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 4 - Input: Bosphorus 4Kgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep1.13182.26363.39544.52725.6594.1244.2044.9145.0125.0301. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200140.58167.42166.76137.91137.48137.97

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200139.61165.51164.66136.60136.46136.81

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep50100150200250186.65224.65223.85188.74189.52189.33

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep30609012015094.62113.02113.2095.7495.5495.60

Neural Magic DeepSparse

Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep122436486044.9153.1252.5844.4944.4144.68

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep112233445542.8348.8848.5141.3141.0241.10

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep160320480640800641.61756.62751.72635.64635.65635.20

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 12 - Input: Bosphorus 4Kgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200179.41187.68158.71159.26161.671. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep61218243021.0924.7924.6821.0021.0321.00

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 12 - Input: Bosphorus 1080pdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep110220330440550419.85490.15485.59442.55442.44416.151. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep163248648061.8171.8271.5461.4661.2261.49

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep163248648062.1071.7671.5861.4961.2561.45

SVT-AV1

Encoder Mode: Preset 12 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 12 - Input: Bosphorus 1080pgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep110220330440550511.80507.02462.33458.52443.321. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 13 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 13 - Input: Bosphorus 1080pdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep130260390520650545.20600.23581.07562.31520.17593.631. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 8 - Input: Bosphorus 1080pgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep306090120150125.22124.70142.19140.17137.611. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 1080p

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 8 - Input: Bosphorus 1080pdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep306090120150136.07134.88134.38149.11150.35148.851. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

VVenC

Video Input: Bosphorus 4K - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fastd2 x AMD EPYC 9334 32-Core9334 2p93334 rep2468106.7687.3977.4477.3581. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

VVenC

Video Input: Bosphorus 1080p - Video Preset: Fast

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fastd2 x AMD EPYC 9334 32-Core9334 2p93334 rep51015202519.1720.7320.8821.011. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Stress-NG

Test: Cloning

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Cloningdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep300600900120015001170.311139.871209.531245.211190.001188.641. (CXX) g++ options: -O2 -std=gnu99 -lc

VVenC

Video Input: Bosphorus 4K - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fasterd2 x AMD EPYC 9334 32-Core9334 2p93334 rep4812162012.6813.7313.6813.801. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

VVenC

Video Input: Bosphorus 1080p - Video Preset: Faster

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fasterd2 x AMD EPYC 9334 32-Core9334 2p93334 rep81624324033.5736.0335.5834.761. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 8 - Input: Bosphorus 4Kgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2040608010092.6494.8389.9196.1194.731. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

SVT-AV1

Encoder Mode: Preset 8 - Input: Bosphorus 4K

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 8 - Input: Bosphorus 4Kdgh2 x AMD EPYC 9334 32-Core9334 2p93334 rep2040608010072.3575.1573.2577.2276.5077.191. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Kripke

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.6d2 x AMD EPYC 9334 32-Core9334 2p93334 rep80M160M240M320M400M3720720003915674003932765003757155001. (CXX) g++ options: -O3 -fopenmp -ldl

Laghos

Test: Triple Point Problem

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemd2 x AMD EPYC 9334 32-Core9334 2p93334 rep4080120160200195.40198.94199.20197.191. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Dragonflydb

Clients Per Thread: 50 - Set To Get Ratio: 1:100

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:100d4M8M12M16M20M17583489.381. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Dragonflydb

Clients Per Thread: 50 - Set To Get Ratio: 1:10

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:10d3M6M9M12M15M15253621.531. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre


Phoronix Test Suite v10.8.5