extra tests

2 x AMD EPYC 9754 128-Core testing with a Supermicro H13DSH (1.5 BIOS) and astdrmfb on AlmaLinux 9.2 via the Phoronix Test Suite.

Compare your own system(s) to this result file with the Phoronix Test Suite by running the command: phoronix-test-suite benchmark 2308305-NE-EXTRATEST07
Jump To Table - Results

View

Do Not Show Noisy Results
Do Not Show Results With Incomplete Data
Do Not Show Results With Little Change/Spread
List Notable Results
Show Result Confidence Charts

Limit displaying results to tests within:

CPU Massive 5 Tests
Creator Workloads 7 Tests
Database Test Suite 2 Tests
Encoding 2 Tests
Game Development 2 Tests
HPC - High Performance Computing 4 Tests
Machine Learning 2 Tests
Multi-Core 8 Tests
NVIDIA GPU Compute 2 Tests
Intel oneAPI 3 Tests
OpenMPI Tests 5 Tests
Renderers 2 Tests
Server 2 Tests
Server CPU Tests 4 Tests
Video Encoding 2 Tests
Common Workstation Benchmarks 2 Tests

Statistics

Show Overall Harmonic Mean(s)
Show Overall Geometric Mean
Show Geometric Means Per-Suite/Category
Show Wins / Losses Counts (Pie Chart)
Normalize Results
Remove Outliers Before Calculating Averages

Graph Settings

Force Line Graphs Where Applicable
Convert To Scalar Where Applicable
Prefer Vertical Bar Graphs

Additional Graphs

Show Perf Per Core/Thread Calculation Graphs Where Applicable
Show Perf Per Clock Calculation Graphs Where Applicable

Multi-Way Comparison

Condense Multi-Option Tests Into Single Result Graphs
Condense Test Profiles With Multiple Version Results Into Single Result Graphs

Table

Show Detailed System Result Table

Run Management

Highlight
Result
Hide
Result
Result
Identifier
View Logs
Performance Per
Dollar
Date
Run
  Test
  Duration
d
August 25 2023
  3 Hours, 8 Minutes
e
August 30 2023
  4 Hours, 23 Minutes
f
August 30 2023
  4 Hours, 22 Minutes
g
August 30 2023
  4 Hours, 25 Minutes
h
August 30 2023
  4 Hours, 23 Minutes
Invert Hiding All Results Option
  4 Hours, 8 Minutes

Only show results where is faster than
Only show results matching title/arguments (delimit multiple options with a comma):
Do not show results matching title/arguments (delimit multiple options with a comma):


extra testsProcessorMotherboardMemoryDiskGraphicsOSKernelCompilerFile-SystemScreen Resolutiondefgh2 x AMD EPYC 9124 16-Core @ 3.00GHz (32 Cores / 64 Threads)Supermicro H13DSH (1.5 BIOS)24 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKET2 x 1920GB SAMSUNG MZQL21T9HCJR-00A07astdrmfbAlmaLinux 9.25.14.0-284.25.1.el9_2.x86_64 (x86_64)GCC 11.3.1 20221121ext41024x7682 x AMD EPYC 9754 128-Core @ 2.25GHz (256 Cores / 512 Threads)23 x 32 GB DDR5-4800MT/s Samsung M321R4GA3BB6-CQKETOpenBenchmarking.orgKernel Details- Transparent Huge Pages: alwaysCompiler Details- --build=x86_64-redhat-linux --disable-libunwind-exceptions --enable-__cxa_atexit --enable-bootstrap --enable-cet --enable-checking=release --enable-gnu-indirect-function --enable-gnu-unique-object --enable-host-bind-now --enable-host-pie --enable-initfini-array --enable-languages=c,c++,fortran,lto --enable-link-serialization=1 --enable-multilib --enable-offload-targets=nvptx-none --enable-plugin --enable-shared --enable-threads=posix --mandir=/usr/share/man --with-arch_32=x86-64 --with-arch_64=x86-64-v2 --with-build-config=bootstrap-lto --with-gcc-major-version-only --with-linker-hash-style=gnu --with-tune=generic --without-cuda-driver --without-isl Processor Details- d: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xa10113e- e: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00116- f: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00116- g: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00116- h: Scaling Governor: acpi-cpufreq performance (Boost: Enabled) - CPU Microcode: 0xaa00116Java Details- OpenJDK Runtime Environment (Red_Hat-11.0.20.0.8-1) (build 11.0.20+8-LTS)Python Details- Python 3.9.16Security Details- itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of usercopy/swapgs barriers and __user pointer sanitization + spectre_v2: Mitigation of Retpolines IBPB: conditional IBRS_FW STIBP: always-on RSB filling PBRSB-eIBRS: Not affected + srbds: Not affected + tsx_async_abort: Not affected

defghLogarithmic Result OverviewPhoronix Test SuiteBlenderEmbreeStress-NGOSPRayIntel Open Image DenoiseNCNNBRL-CADNeural Magic DeepSparseTimed Linux Kernel CompilationApache CassandraLiquid-DSPSVT-AV1

extra testsbrl-cad: VGR Performance Metricncnn: CPU - FastestDetncnn: CPU - vision_transformerncnn: CPU - regnety_400mncnn: CPU - squeezenet_ssdncnn: CPU - yolov4-tinyncnn: CPU - resnet50ncnn: CPU - alexnetncnn: CPU - resnet18ncnn: CPU - vgg16ncnn: CPU - googlenetncnn: CPU - blazefacencnn: CPU - efficientnet-b0ncnn: CPU - mnasnetncnn: CPU - shufflenet-v2ncnn: CPU-v3-v3 - mobilenet-v3ncnn: CPU-v2-v2 - mobilenet-v2ncnn: CPU - mobilenetospray: particle_volume/scivis/real_timedeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Streamospray: particle_volume/pathtracer/real_timedeepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Streamdeepsparse: BERT-Large, NLP Question Answering - Synchronous Single-Streamblender: Barbershop - CPU-Onlycassandra: Writesdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamospray: particle_volume/ao/real_timedeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Streamospray: gravity_spheres_volume/dim_512/scivis/real_timeospray: gravity_spheres_volume/dim_512/ao/real_timestress-ng: CPU Cachedeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Streamospray: gravity_spheres_volume/dim_512/pathtracer/real_timedeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Streamblender: Pabellon Barcelona - CPU-Onlydeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Synchronous Single-Streamsvt-av1: Preset 4 - Bosphorus 4Kdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO - Synchronous Single-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Synchronous Single-Streamdeepsparse: ResNet-50, Baseline - Synchronous Single-Streamdeepsparse: ResNet-50, Baseline - Synchronous Single-Streamdeepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Streamdeepsparse: ResNet-50, Sparse INT8 - Synchronous Single-Streamlaghos: Sedov Blast Wave, ube_922_hex.meshsvt-av1: Preset 4 - Bosphorus 4Kstress-ng: NUMAstress-ng: Atomicnekrs: TurboPipe Periodicblender: Classroom - CPU-Onlystress-ng: Pthreadstress-ng: Cloningstress-ng: Futexstress-ng: Mallocstress-ng: MEMFDstress-ng: Mixed Schedulerstress-ng: MMAPstress-ng: Zlibstress-ng: Fused Multiply-Addstress-ng: Vector Shufflestress-ng: Pollstress-ng: Hashstress-ng: Forkingstress-ng: Matrix Mathstress-ng: System V Message Passingstress-ng: Vector Floating Pointstress-ng: Socket Activitystress-ng: Glibc C String Functionsstress-ng: Mutexstress-ng: Function Callstress-ng: Cryptostress-ng: Memory Copyingstress-ng: Glibc Qsort Data Sortingstress-ng: Wide Vector Mathstress-ng: AVL Treestress-ng: Matrix 3D Mathstress-ng: x86_64 RdRandstress-ng: Vector Mathstress-ng: AVX-512 VNNIstress-ng: Floating Pointstress-ng: Semaphoresstress-ng: CPU Stressstress-ng: SENDFILEstress-ng: Pipestress-ng: Context Switchingliquid-dsp: 64 - 256 - 512liquid-dsp: 32 - 256 - 512liquid-dsp: 16 - 256 - 512liquid-dsp: 8 - 256 - 512liquid-dsp: 64 - 256 - 57liquid-dsp: 64 - 256 - 32liquid-dsp: 4 - 256 - 512liquid-dsp: 32 - 256 - 57liquid-dsp: 2 - 256 - 512liquid-dsp: 16 - 256 - 57liquid-dsp: 32 - 256 - 32liquid-dsp: 1 - 256 - 32liquid-dsp: 16 - 256 - 32liquid-dsp: 1 - 256 - 512liquid-dsp: 8 - 256 - 57liquid-dsp: 8 - 256 - 32liquid-dsp: 4 - 256 - 57liquid-dsp: 4 - 256 - 32liquid-dsp: 2 - 256 - 57liquid-dsp: 2 - 256 - 32liquid-dsp: 1 - 256 - 57laghos: Triple Point Problemnekrs: Kershawspecfem3d: Layered Halfspacebuild-linux-kernel: defconfigspecfem3d: Water-layered Halfspacespecfem3d: Homogeneous Halfspacespecfem3d: Mount St. Helensspecfem3d: Tomographic Modeldragonflydb: 50 - 1:10dragonflydb: 50 - 1:100blender: Fishy Cat - CPU-Onlydragonflydb: 20 - 1:10dragonflydb: 20 - 1:100kripke: dragonflydb: 10 - 1:100dragonflydb: 10 - 1:10vvenc: Bosphorus 4K - Fastsvt-av1: Preset 4 - Bosphorus 1080premhos: Sample Remap Exampleembree: Pathtracer - Asian Dragon Objoidn: RTLightmap.hdr.4096x4096 - CPU-Onlyembree: Pathtracer ISPC - Asian Dragon Objblender: BMW27 - CPU-Onlysvt-av1: Preset 4 - Bosphorus 1080pvvenc: Bosphorus 4K - Fastersvt-av1: Preset 8 - Bosphorus 4Koidn: RT.ldr_alb_nrm.3840x2160 - CPU-Onlyoidn: RT.hdr_alb_nrm.3840x2160 - CPU-Onlysvt-av1: Preset 8 - Bosphorus 4Kembree: Pathtracer - Crownembree: Pathtracer ISPC - Crownvvenc: Bosphorus 1080p - Fastembree: Pathtracer - Asian Dragonsvt-av1: Preset 8 - Bosphorus 1080psvt-av1: Preset 8 - Bosphorus 1080pembree: Pathtracer ISPC - Asian Dragonsvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Ksvt-av1: Preset 12 - Bosphorus 4Ksvt-av1: Preset 13 - Bosphorus 4Kvvenc: Bosphorus 1080p - Fastersvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 12 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080psvt-av1: Preset 13 - Bosphorus 1080pdefgh5446481655.8134.5725.6140.3426.688.1413.7528.5730.115.8415.4310.8315.4612.0111.8323.9710.845311.086890.0756176.61458.827916.9961372.72230402641.605124.715310.86344.9317202.50199.8131810.0774776708.72436.974936.384942.8337373.027919.635650.9089795.85219.81794.038719.876321.0894757.4777.1456139.7934186.648285.485411.900271.3514.013971.836213.9192150.276106.146744.9148355.887740.734524.536122.720.71448.251294.6197168.724710.081399.1222140.5767113.6578139.6127114.28214.745467.765761.8053258.559914.663268.162162.1006257.42375.99472661.22557.1228140.20147.1102140.45681.1189890.9448263.855.05318.35236.99747049000094.8870043.781170.314022496.07137476652.93912.6235640.361131.552944.5432151217.0325006.734337787.727218607.121007.46173878.76851573.2106715.859504.3840991307.95438280.1627731.16107643.4913428.64916.741543291.91410.8910869.4912013602.06234239.63667964.1511939.4192019395.1388466.54857324.8920249488.6117931974.92504660000387460000195510000964490001778600000204560000049884000123450000024900000650410000107380000035195000549390000126520003312000002717300001742800001365000001056300006883400052765000195.401152090000038.48977964935.28334.82222748218.85382887614.50564525314.87771034615253621.5317583489.3848.9515179164.8614250093.7137207200011860591.6511686474.266.76820.90437.98760.6540.572637.9314.97412.67872.3531.371.3637.67739.384119.16642.7641136.07148.4746195.829199.52333.572419.853545.195643465143.01102.96191.2356.8548.0950.4816.728.7555.5255.321.7448.5229.3343.7334.7531.8843.0446.596113.997571.4085161.19641.08924.331671.23185913772.2797163.820846.66067.3697135.642549.083250.3927348317.23584.6004216.862249.42452584.390514.588868.5055972.7478128.4447963.8636129.858524.75975158.66278.9987111.0764224.2533567.941328.171432.984930.309134.816228.7153285.1281447.14255.06422319.875622.757543.901422.2616.516660.507113.46241125.03358.1721122.24814.135168.9014755.2163166.1603767.63465.5308180.502371.89821776.2195.4651182.801772.05411773.16729.016214145.43097.2759137.35997.2672137.52851.6625600.57444.5814.26205.0917.1654645.351161.562781554.43712254498.06120.8529901.534118.1916620.95179841973.66153882.1921326879.3645096278.1330826.86989961.327717311.24605694.4111104.17215179957.06363805.32159228.55464310.6672762.674849.68239176.311473.1513265.6778670488.151287349.0220114751.5671642.41866657136.27517736.813678593.69157840183.6379639504.69659610000330490000164350000817150002066300000182710000041506000107110000020808000534940000916020000291900004569800001055300027689000022866000015093000011383000088038000572840004411300022.77410.059.731184.1061.50187.00517.4413.18573.6243.183.2493.281189.4439189.1949205.7092123.523133.868227.0315169.932188.744169.887195.357490.984495.83594.839602.213635368244.0798.31197.260.5250.3955.2616.5430.3161.0156.1522.1248.7330.9847.0934.2930.742.5446.594814.067571.0545160.91344.146522.646671.04198090752.5783168.092446.55997.4283134.568448.900650.0857352745.53585.3364215.658849.26172593.454216.637760.0758969.1736129.1324972.5049128.972924.6525181.35548.8905112.4238222.9581570.633128.268432.717830.555634.537728.9472285.5189446.499455.65252295.607522.834643.754522.6416.155761.8596113.24041127.39138.9799111.26434.179169.0466755.3461166.5217765.85235.5536179.777471.9811775.03945.4849182.13671.97941774.43848.372515229.27117.2453137.93857.2233138.36231.6716597.37034.64614.09204.5617.2154748.851158.862722241.67714509917.7116.4429475.622268.3416624.07183254707.86154301.8421392880.4145263381.8430923.93990014.157746308.69606558.87109282.45215059237.87363091.11159300.62457992.6272822.864847.938234696.931444.9613014.8178670864.071284285.0920046142.6771775.48775690998.4516299.643224251.1416814251296067858.09659970000330090000166500000823430002081700000182790000041015000105520000020873000553290000913640000292140004581400001056300027814000022369000014651000011436000088120000571940004414400023.4710.119.873180.13761.47184.23927.5213.44571.7123.073.2593.657185.3561189.3153206.166124.81134.062224.8996196.319161.816183.466202.493503.076483.011588.809602.993639840444.27108.73202.3458.6648.0853.6215.129.2656.3457.7226.947.6729.9257.6735.9631.2441.2146.605413.847372.184161.16942.038823.782171.97185134756.6198166.733546.65677.3831135.395549.512350.4014334039.11573.3582220.906948.88412613.08814.415969.3271969.8849129.3849969.2352129.438524.79475150.42528.7387114.3784224.6455566.725928.352130.223533.077931.451931.7865256.1642497.897653.11562405.875520.598148.498922.415.022366.5223113.01911128.88468.0302124.40724.124167.4236761.8657165.5104770.50655.4661182.641771.82141778.36165.4478183.371771.76191780.23798.584314854.8567.2111138.5947.2026138.75841.6492605.46724.61514.38204.4417.3154906.431139.872951631.96719479964.3397.9429932.23664.3416624.07180789739.08154267.3421346079.345298625.9131064.95989553.87729966.17604724.84110254.19215429087.61364035.57156381.63462528.7372825.564846.228237581.371564.4214658.7578673620.141288008.2620133541.1171782.95800762640.11523898.453761786.29173499514.94110432440.32659140000331240000165180000830900002076600000182760000041641000105730000020660000542300000914570000292190004578200001057600027984000022852000015101000011455000088109000557080004418700023.05810.029.831193.05021.50193.42977.5113.57575.153.283.4092.641183.4023193.6041217.591125.22134.875239.4187179.405191.665189.164203.007490.154511.795600.23599.573627057742.393.15201.5958.4148.8254.4717.3929.5658.8555.3623.6348.7629.5545.2237.5630.8944.1446.552812.828677.9169160.8735.850827.884971.39190955751.7219168.750646.75627.4589134.015950.087950.9618361173.71567.855222.858648.51072633.59213.96571.5654954.9445131.1013952.8689131.543224.67775175.48638.6016116.1927223.8503569.320828.253730.365132.922630.723132.5406239.1915532.459452.5772429.741819.744650.597522.3514.564968.6123113.20271127.79257.6734130.19294.204166.7561764.7374164.6605774.40545.4481183.236571.53551786.17565.4268184.092171.58151784.35599.697413153.49587.1163140.44037.1792139.21451.6419608.16114.59314.46203.4917.1255328.711209.532880825.11725705488.15115.8329481.453749.4716625.57181626937.85154293.3320251576.3345265797.3430591.67990104.857761412.61605340.53113347.77215308424.81363892.1158994.25461785.3672800.744850.998234456.231628.9115802.8678690548.491285121.420123210.1471790.06786069338.66518600.853745995.16159491222.34110991931.6652820000329210000165390000827490002072000000182210000041052000106500000020837000546690000911870000292000004574200001056700027725000022809000015536000011439000088042000559230004418300023.04610.029.901172.20681.54184.45377.412.91373.253.463.5394.831187.4161194.2743207.2492124.695134.38225.8305187.677177.363194.863197.03485.593507.017581.067600.762OpenBenchmarking.org

BRL-CAD

BRL-CAD is a cross-platform, open-source solid modeling system with built-in benchmark mode. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgVGR Performance Metric, More Is BetterBRL-CAD 7.36VGR Performance Metricdefgh1.4M2.8M4.2M5.6M7M54464864346516353682639840462705771. (CXX) g++ options: -std=c++14 -pipe -fvisibility=hidden -fno-strict-aliasing -fno-common -fexceptions -ftemplate-depth-128 -m64 -ggdb3 -O3 -fipa-pta -fstrength-reduce -finline-functions -flto -ltcl8.6 -lregex_brl -lz_brl -lnetpbm -ldl -lm -ltk8.6

NCNN

NCNN is a high performance neural network inference framework optimized for mobile and other platforms developed by Tencent. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: FastestDetdefgh102030405016.0043.0144.0744.2742.30MIN: 15.86 / MAX: 20.49MIN: 41.43 / MAX: 170.26MIN: 42.4 / MAX: 239.24MIN: 40.55 / MAX: 822.5MIN: 41.82 / MAX: 83.331. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vision_transformerdefgh2040608010055.05102.9698.31108.7393.15MIN: 54.24 / MAX: 72.37MIN: 93.93 / MAX: 746.86MIN: 93.92 / MAX: 519.01MIN: 94.5 / MAX: 1869.82MIN: 88.26 / MAX: 206.511. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: regnety_400mdefgh408012016020033.50191.23197.20202.34201.59MIN: 33.26 / MAX: 37.92MIN: 183.47 / MAX: 798.79MIN: 189.74 / MAX: 813.26MIN: 187.7 / MAX: 1161.84MIN: 191.08 / MAX: 1525.041. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: squeezenet_ssddefgh142842567024.2256.8560.5258.6658.41MIN: 23.8 / MAX: 28.97MIN: 55.3 / MAX: 146.7MIN: 59.42 / MAX: 125.02MIN: 55.23 / MAX: 333.73MIN: 57.55 / MAX: 98.81. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: yolov4-tinydefgh112233445535.7548.0950.3948.0848.82MIN: 34.32 / MAX: 45.5MIN: 44.42 / MAX: 108.3MIN: 44.47 / MAX: 465.37MIN: 42.94 / MAX: 393.57MIN: 45.36 / MAX: 93.881. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet50defgh122436486026.5350.4855.2653.6254.47MIN: 26.13 / MAX: 30.95MIN: 49.07 / MAX: 131.05MIN: 53.64 / MAX: 170.04MIN: 51.72 / MAX: 288.16MIN: 51.64 / MAX: 145.971. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: alexnetdefgh481216208.1316.7016.5415.1017.39MIN: 7.95 / MAX: 8.44MIN: 15.23 / MAX: 164.75MIN: 15.67 / MAX: 44.81MIN: 14.22 / MAX: 74.43MIN: 14.86 / MAX: 374.111. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: resnet18defgh71421283513.6528.7530.3129.2629.56MIN: 13.29 / MAX: 15.11MIN: 27.58 / MAX: 199.58MIN: 29.56 / MAX: 32.56MIN: 26.62 / MAX: 124.57MIN: 27.99 / MAX: 114.191. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: vgg16defgh142842567028.6955.5261.0156.3458.85MIN: 28.07 / MAX: 37.47MIN: 50.63 / MAX: 278.46MIN: 55.16 / MAX: 391.41MIN: 50.98 / MAX: 172.01MIN: 55.68 / MAX: 98.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: googlenetdefgh132639526529.0355.3056.1557.7255.36MIN: 27.97 / MAX: 36.49MIN: 52.38 / MAX: 300.85MIN: 53.89 / MAX: 379.5MIN: 52.79 / MAX: 429.12MIN: 54.42 / MAX: 115.961. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: blazefacedefgh6121824305.6321.7422.1226.9023.63MIN: 5.53 / MAX: 5.95MIN: 21.34 / MAX: 78.44MIN: 21.92 / MAX: 24.06MIN: 21.88 / MAX: 899.43MIN: 21.98 / MAX: 379.781. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: efficientnet-b0defgh112233445514.7648.5248.7347.6748.76MIN: 14.47 / MAX: 29.12MIN: 46.31 / MAX: 227.29MIN: 45.75 / MAX: 396.94MIN: 47.05 / MAX: 112.46MIN: 47.17 / MAX: 210.731. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mnasnetdefgh71421283510.7029.3330.9829.9229.55MIN: 10.22 / MAX: 14.66MIN: 27.57 / MAX: 174.05MIN: 27.55 / MAX: 508.86MIN: 27.82 / MAX: 234.43MIN: 28.64 / MAX: 97.11. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: shufflenet-v2defgh132639526513.9643.7347.0957.6745.22MIN: 13.65 / MAX: 18.71MIN: 40.38 / MAX: 347.86MIN: 41.43 / MAX: 934.54MIN: 40.74 / MAX: 1797.58MIN: 40.32 / MAX: 296.921. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v3-v3 - Model: mobilenet-v3defgh91827364511.6434.7534.2935.9637.56MIN: 11.48 / MAX: 15.48MIN: 33.3 / MAX: 84.81MIN: 33.52 / MAX: 74.33MIN: 34.15 / MAX: 266.64MIN: 33.74 / MAX: 351.291. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU-v2-v2 - Model: mobilenet-v2defgh71421283511.2931.8830.7031.2430.89MIN: 10.83 / MAX: 11.91MIN: 29.92 / MAX: 138.47MIN: 29.73 / MAX: 58.64MIN: 30.27 / MAX: 71.7MIN: 29.59 / MAX: 58.991. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OpenBenchmarking.orgms, Fewer Is BetterNCNN 20230517Target: CPU - Model: mobilenetdefgh102030405023.5543.0442.5441.2144.14MIN: 23.18 / MAX: 27.8MIN: 41.85 / MAX: 91.85MIN: 41.2 / MAX: 128.05MIN: 40.54 / MAX: 69.23MIN: 42.79 / MAX: 118.071. (CXX) g++ options: -O3 -rdynamic -lgomp -lpthread

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/scivis/real_timedefgh112233445510.8546.6046.5946.6146.55

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh4812162011.0914.0014.0713.8512.83

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh2040608010090.0871.4171.0572.1877.92

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/pathtracer/real_timedefgh4080120160200176.61161.20160.91161.17160.87

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Streamdefgh132639526558.8341.0944.1542.0435.85

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Synchronous Single-Streamdefgh71421283517.0024.3322.6523.7827.88

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

Build: allmodconfig

d: The test quit with a non-zero exit status.

e: The test quit with a non-zero exit status.

f: The test quit with a non-zero exit status.

g: The test quit with a non-zero exit status.

h: The test quit with a non-zero exit status.

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Barbershop - Compute: CPU-Onlydefgh80160240320400372.7271.2371.0471.9771.39

Apache Cassandra

This is a benchmark of the Apache Cassandra NoSQL database management system making use of cassandra-stress. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOp/s, More Is BetterApache Cassandra 4.1.3Test: Writesdefgh50K100K150K200K250K230402185913198090185134190955

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamdefgh170340510680850641.61772.28752.58756.62751.72

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Streamdefgh408012016020024.72163.82168.09166.73168.75

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: particle_volume/ao/real_timedefgh112233445510.8646.6646.5646.6646.76

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh2468104.93177.36977.42837.38317.4589

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh4080120160200202.50135.64134.57135.40134.02

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/scivis/real_timedefgh11223344559.8131849.0832048.9006049.5123050.08790

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/ao/real_timedefgh112233445510.0850.3950.0950.4050.96

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU Cachedefgh170K340K510K680K850K776708.72348317.23352745.53334039.11361173.711. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamdefgh130260390520650436.97584.60585.34573.36567.86

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Streamdefgh5010015020025036.38216.86215.66220.91222.86

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh112233445542.8349.4249.2648.8848.51

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh6001200180024003000373.032584.392593.452613.092633.59

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Streamdefgh51015202519.6414.5916.6414.4213.97

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Synchronous Single-Streamdefgh163248648050.9168.5160.0869.3371.57

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamdefgh2004006008001000795.85972.75969.17969.88954.94

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Streamdefgh30609012015019.81128.44129.13129.38131.10

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Streamdefgh2004006008001000794.04963.86972.50969.24952.87

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Streamdefgh30609012015019.88129.86128.97129.44131.54

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh61218243021.0924.7624.6524.7924.68

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh11002200330044005500757.485158.665181.365150.435175.49

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Streamdefgh36912157.14568.99878.89058.73878.6016

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Synchronous Single-Streamdefgh306090120150139.79111.08112.42114.38116.19

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Streamdefgh50100150200250186.65224.25222.96224.65223.85

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, BERT base uncased SST2 - Scenario: Asynchronous Multi-Streamdefgh12024036048060085.49567.94570.63566.73569.32

OSPRay

Intel OSPRay is a portable ray-tracing engine for high-performance, high-fidelity scientific visualizations. OSPRay builds off Intel's Embree and Intel SPMD Program Compiler (ISPC) components as part of the oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgItems Per Second, More Is BetterOSPRay 2.12Benchmark: gravity_spheres_volume/dim_512/pathtracer/real_timedefgh71421283511.9028.1728.2728.3528.25

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Streamdefgh163248648071.3532.9832.7230.2230.37

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Synchronous Single-Streamdefgh81624324014.0130.3130.5633.0832.92

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Streamdefgh163248648071.8434.8234.5431.4530.72

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Synchronous Single-Streamdefgh81624324013.9228.7228.9531.7932.54

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Streamdefgh60120180240300150.28285.13285.52256.16239.19

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Asynchronous Multi-Streamdefgh120240360480600106.15447.14446.50497.90532.46

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamdefgh132639526544.9155.0655.6553.1252.58

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Scenario: Asynchronous Multi-Streamdefgh5001000150020002500355.892319.882295.612405.882429.74

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Streamdefgh91827364540.7322.7622.8320.6019.74

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Synchronous Single-Streamdefgh112233445524.5443.9043.7548.5050.60

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Pabellon Barcelona - Compute: CPU-Onlydefgh306090120150122.7022.2622.6422.4022.35

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Streamdefgh51015202520.7116.5216.1615.0214.56

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Scenario: Synchronous Single-Streamdefgh153045607548.2560.5161.8666.5268.61

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Streamdefgh30609012015094.62113.46113.24113.02113.20

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Streamdefgh2004006008001000168.721125.031127.391128.881127.79

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Streamdefgh369121510.08138.17218.97998.03027.6734

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: NLP Text Classification, DistilBERT mnli - Scenario: Synchronous Single-Streamdefgh30609012015099.12122.25111.26124.41130.19

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 4 - Input: Bosphorus 4Kefgh0.94591.89182.83773.78364.72954.1354.1794.1244.2041. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Neural Magic DeepSparse

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Streamdefgh4080120160200140.58168.90169.05167.42166.76

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Streamdefgh160320480640800113.66755.22755.35761.87764.74

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh4080120160200139.61166.16166.52165.51164.66

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh170340510680850114.28767.63765.85770.51774.41

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Streamdefgh4812162014.74545.53085.55365.46615.4481

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO - Scenario: Synchronous Single-Streamdefgh408012016020067.77180.50179.78182.64183.24

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Streamdefgh163248648061.8171.9071.9871.8271.54

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Streamdefgh400800120016002000258.561776.221775.041778.361786.18

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh4812162014.66325.46515.48495.44785.4268

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh408012016020068.16182.80182.14183.37184.09

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamdefgh163248648062.1072.0571.9871.7671.58

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Streamdefgh400800120016002000257.421773.171774.441780.241784.36

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh36912155.99479.01628.37258.58439.6974

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Streamdefgh3K6K9K12K15K2661.2314145.4315229.2714854.8613153.50

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Streamdefgh2468107.12287.27597.24537.21117.1163

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: CV Classification, ResNet-50 ImageNet - Scenario: Synchronous Single-Streamdefgh306090120150140.20137.36137.94138.59140.44

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Synchronous Single-Streamdefgh2468107.11027.26727.22337.20267.1792

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Baseline - Scenario: Synchronous Single-Streamdefgh306090120150140.46137.53138.36138.76139.21

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh0.37610.75221.12831.50441.88051.11891.66251.67161.64921.6419

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.5Model: ResNet-50, Sparse INT8 - Scenario: Synchronous Single-Streamdefgh2004006008001000890.94600.57597.37605.47608.16

Laghos

Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Sedov Blast Wave, ube_922_hex.meshd60120180240300263.851. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Test: Sedov Blast Wave, ube_922_hex.mesh

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2406354] pml_ucx.c:419 Error: ucp_ep_create(proc=129) failed: Destination is unreachable

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2763172] pml_ucx.c:419 Error: ucp_ep_create(proc=60) failed: Endpoint timeout

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1783809] pml_ucx.c:419 Error: ucp_ep_create(proc=153) failed: Input/output error

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1412464] pml_ucx.c:419 Error: ucp_ep_create(proc=82) failed: Destination is unreachable

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 4 - Input: Bosphorus 4Kdefgh1.13692.27383.41074.54765.68455.0534.5804.6464.6154.5931. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: NUMAdefgh51015202518.3514.2614.0914.3814.461. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Atomicdefgh50100150200250236.99205.09204.56204.44203.491. (CXX) g++ options: -O2 -std=gnu99 -lc

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: TurboPipe Periodicd1600M3200M4800M6400M8000M74704900001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: TurboPipe Periodic

e: The test quit with a non-zero exit status. E: mpi_errors_are_fatal unknown handle

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2773445] pml_ucx.c:419 Error: ucp_ep_create(proc=100) failed: Destination is unreachable

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1794008] pml_ucx.c:419 Error: ucp_ep_create(proc=37) failed: Input/output error

h: The test quit with a non-zero exit status. E: mpi_errors_are_fatal unknown handle

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Classroom - Compute: CPU-Onlydefgh2040608010094.8817.1617.2117.3117.12

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Pthreaddefgh15K30K45K60K75K70043.7854645.3554748.8554906.4355328.711. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Cloningdefgh300600900120015001170.311161.561158.861139.871209.531. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Futexdefgh900K1800K2700K3600K4500K4022496.072781554.432722241.672951631.962880825.111. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mallocdefgh160M320M480M640M800M137476652.93712254498.06714509917.70719479964.33725705488.151. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MEMFDdefgh2004006008001000912.62120.85116.4497.94115.831. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mixed Schedulerdefgh8K16K24K32K40K35640.3629901.5329475.6229932.2029481.451. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MMAPdefgh90018002700360045001131.554118.192268.343664.343749.471. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Zlibdefgh4K8K12K16K20K2944.5416620.9516624.0716624.0716625.571. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Fused Multiply-Adddefgh40M80M120M160M200M32151217.03179841973.66183254707.86180789739.08181626937.851. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Shuffledefgh30K60K90K120K150K25006.73153882.19154301.84154267.34154293.331. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Polldefgh5M10M15M20M25M4337787.7221326879.3621392880.4121346079.3020251576.331. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Hashdefgh10M20M30M40M50M7218607.1245096278.1345263381.8445298625.9145265797.341. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Forkingdefgh7K14K21K28K35K1007.4630826.8630923.9331064.9530591.671. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix Mathdefgh200K400K600K800K1000K173878.70989961.32990014.15989553.80990104.851. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: System V Message Passingdefgh1.7M3.4M5.1M6.8M8.5M6851573.207717311.247746308.697729966.177761412.611. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Floating Pointdefgh130K260K390K520K650K106715.85605694.40606558.87604724.84605340.531. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Socket Activitydefgh20K40K60K80K100K9504.38111104.17109282.45110254.19113347.771. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Glibc C String Functionsdefgh50M100M150M200M250M40991307.95215179957.06215059237.87215429087.61215308424.811. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mutexdefgh90K180K270K360K450K438280.16363805.32363091.11364035.57363892.101. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Function Calldefgh30K60K90K120K150K27731.16159228.55159300.62156381.63158994.251. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Cryptodefgh100K200K300K400K500K107643.49464310.66457992.62462528.73461785.361. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Memory Copyingdefgh16K32K48K64K80K13428.6472762.6772822.8672825.5672800.741. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Glibc Qsort Data Sortingdefgh10002000300040005000916.744849.604847.934846.224850.991. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Wide Vector Mathdefgh2M4M6M8M10M1543291.918239176.318234696.938237581.378234456.231. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVL Treedefgh30060090012001500410.891473.151444.961564.421628.911. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix 3D Mathdefgh3K6K9K12K15K10869.4913265.6713014.8114658.7515802.861. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: x86_64 RdRanddefgh20M40M60M80M100M12013602.0678670488.1578670864.0778673620.1478690548.491. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Mathdefgh300K600K900K1200K1500K234239.601287349.021284285.091288008.261285121.401. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVX-512 VNNIdefgh4M8M12M16M20M3667964.1520114751.5620046142.6720133541.1120123210.141. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Floating Pointdefgh15K30K45K60K75K11939.4171642.4171775.4871782.9571790.061. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Semaphoresdefgh200M400M600M800M1000M92019395.13866657136.27775690998.40800762640.11786069338.661. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU Stressdefgh110K220K330K440K550K88466.54517736.81516299.64523898.45518600.851. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: SENDFILEdefgh800K1600K2400K3200K4000K857324.893678593.693224251.143761786.293745995.161. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Pipedefgh40M80M120M160M200M20249488.61157840183.63168142512.00173499514.94159491222.341. (CXX) g++ options: -O2 -std=gnu99 -lc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Context Switchingdefgh20M40M60M80M100M17931974.9279639504.6996067858.09110432440.32110991931.601. (CXX) g++ options: -O2 -std=gnu99 -lc

Liquid-DSP

LiquidSDR's Liquid-DSP is a software-defined radio (SDR) digital signal processing library. This test profile runs a multi-threaded benchmark of this SDR/DSP library focused on embedded platform usage. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 512defgh140M280M420M560M700M5046600006596100006599700006591400006528200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 512defgh80M160M240M320M400M3874600003304900003300900003312400003292100001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 512defgh40M80M120M160M200M1955100001643500001665000001651800001653900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 8 - Buffer Length: 256 - Filter Length: 512defgh20M40M60M80M100M96449000817150008234300083090000827490001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 57defgh400M800M1200M1600M2000M177860000020663000002081700000207660000020720000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 64 - Buffer Length: 256 - Filter Length: 32defgh400M800M1200M1600M2000M204560000018271000001827900000182760000018221000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 4 - Buffer Length: 256 - Filter Length: 512defgh11M22M33M44M55M49884000415060004101500041641000410520001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 57defgh300M600M900M1200M1500M123450000010711000001055200000105730000010650000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 2 - Buffer Length: 256 - Filter Length: 512defgh5M10M15M20M25M24900000208080002087300020660000208370001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 57defgh140M280M420M560M700M6504100005349400005532900005423000005466900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 32 - Buffer Length: 256 - Filter Length: 32defgh200M400M600M800M1000M10738000009160200009136400009145700009118700001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 32defgh8M16M24M32M40M35195000291900002921400029219000292000001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 16 - Buffer Length: 256 - Filter Length: 32defgh120M240M360M480M600M5493900004569800004581400004578200004574200001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 512defgh3M6M9M12M15M12652000105530001056300010576000105670001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 8 - Buffer Length: 256 - Filter Length: 57defgh70M140M210M280M350M3312000002768900002781400002798400002772500001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 8 - Buffer Length: 256 - Filter Length: 32defgh60M120M180M240M300M2717300002286600002236900002285200002280900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 4 - Buffer Length: 256 - Filter Length: 57defgh40M80M120M160M200M1742800001509300001465100001510100001553600001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 4 - Buffer Length: 256 - Filter Length: 32defgh30M60M90M120M150M1365000001138300001143600001145500001143900001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 2 - Buffer Length: 256 - Filter Length: 57defgh20M40M60M80M100M105630000880380008812000088109000880420001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 2 - Buffer Length: 256 - Filter Length: 32defgh15M30M45M60M75M68834000572840005719400055708000559230001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

OpenBenchmarking.orgsamples/s, More Is BetterLiquid-DSP 1.6Threads: 1 - Buffer Length: 256 - Filter Length: 57defgh11M22M33M44M55M52765000441130004414400044187000441830001. (CC) gcc options: -O3 -pthread -lm -lc -lliquid

Laghos

Laghos (LAGrangian High-Order Solver) is a miniapp that solves the time-dependent Euler equations of compressible gas dynamics in a moving Lagrangian frame using unstructured high-order finite element spatial discretization and explicit high-order time-stepping. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgMajor Kernels Total Rate, More Is BetterLaghos 3.1Test: Triple Point Problemd4080120160200195.401. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Test: Triple Point Problem

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2404677] pml_ucx.c:419 Error: ucp_ep_create(proc=86) failed: Destination is unreachable

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2761984] pml_ucx.c:419 Error: ucp_ep_create(proc=100) failed: Endpoint timeout

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1782157] pml_ucx.c:419 Error: ucp_ep_create(proc=13) failed: Destination is unreachable

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1411089] pml_ucx.c:419 Error: ucp_ep_create(proc=62) failed: Endpoint timeout

nekRS

nekRS is an open-source Navier Stokes solver based on the spectral element method. NekRS supports both CPU and GPU/accelerator support though this test profile is currently configured for CPU execution. NekRS is part of Nek5000 of the Mathematics and Computer Science MCS at Argonne National Laboratory. This nekRS benchmark is primarily relevant to large core count HPC servers and otherwise may be very time consuming on smaller systems. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgflops/rank, More Is BetternekRS 23.0Input: Kershawd2000M4000M6000M8000M10000M115209000001. (CXX) g++ options: -fopenmp -O2 -march=native -mtune=native -ftree-vectorize -rdynamic -lmpi_cxx -lmpi

Input: Kershaw

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2415260] pml_ucx.c:419 Error: ucp_ep_create(proc=53) failed: Input/output error

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2772472] pml_ucx.c:419 Error: ucp_ep_create(proc=103) failed: Destination is unreachable

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1792976] pml_ucx.c:419 Error: ucp_ep_create(proc=131) failed: Destination is unreachable

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1421720] pml_ucx.c:419 Error: ucp_ep_create(proc=97) failed: Endpoint timeout

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Layered Halfspaced91827364538.491. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Model: Layered Halfspace

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2410113] pml_ucx.c:419 Error: ucp_ep_create(proc=74) failed: Destination is unreachable

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2767075] pml_ucx.c:419 Error: ucp_ep_create(proc=57) failed: Destination is unreachable

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1787630] pml_ucx.c:419 Error: ucp_ep_create(proc=111) failed: Input/output error

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1416584] pml_ucx.c:419 Error: ucp_ep_create(proc=154) failed: Input/output error

Timed Linux Kernel Compilation

This test times how long it takes to build the Linux kernel in a default configuration (defconfig) for the architecture being tested or alternatively an allmodconfig for building all possible kernel modules for the build. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfigdefgh81624324035.2822.7723.4723.0623.05

SPECFEM3D

simulates acoustic (fluid), elastic (solid), coupled acoustic/elastic, poroelastic or seismic wave propagation in any type of conforming mesh of hexahedra. This test profile currently relies on CPU-based execution for SPECFEM3D and using a variety of their built-in examples/models for benchmarking. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Water-layered Halfspaced81624324034.821. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Model: Water-layered Halfspace

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2413958] pml_ucx.c:419 Error: ucp_ep_create(proc=40) failed: Endpoint timeout

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2771078] pml_ucx.c:419 Error: ucp_ep_create(proc=73) failed: Input/output error

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1791559] pml_ucx.c:419 Error: ucp_ep_create(proc=100) failed: Input/output error

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1420136] pml_ucx.c:419 Error: ucp_ep_create(proc=93) failed: Destination is unreachable

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Homogeneous Halfspaced51015202518.851. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Model: Homogeneous Halfspace

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2412818] pml_ucx.c:419 Error: ucp_ep_create(proc=84) failed: Destination is unreachable

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2769536] pml_ucx.c:419 Error: ucp_ep_create(proc=248) failed: Input/output error

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1790169] pml_ucx.c:419 Error: ucp_ep_create(proc=60) failed: Destination is unreachable

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1419043] pml_ucx.c:419 Error: ucp_ep_create(proc=43) failed: Destination is unreachable

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Mount St. Helensd4812162014.511. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Model: Mount St. Helens

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2408581] pml_ucx.c:419 Error: ucp_ep_create(proc=0) failed: Destination is unreachable

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2765844] pml_ucx.c:419 Error: ucp_ep_create(proc=101) failed: Destination is unreachable

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1786040] pml_ucx.c:419 Error: ucp_ep_create(proc=17) failed: Input/output error

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1415047] pml_ucx.c:419 Error: ucp_ep_create(proc=80) failed: Endpoint timeout

OpenBenchmarking.orgSeconds, Fewer Is BetterSPECFEM3D 4.0Model: Tomographic Modeld4812162014.881. (F9X) gfortran options: -O2 -fopenmp -std=f2003 -fimplicit-none -fmax-errors=10 -pedantic -pedantic-errors -O3 -finline-functions -lmpi_usempif08 -lmpi_mpifh -lmpi

Model: Tomographic Model

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2411457] pml_ucx.c:419 Error: ucp_ep_create(proc=83) failed: Destination is unreachable

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2768548] pml_ucx.c:419 Error: ucp_ep_create(proc=119) failed: Destination is unreachable

g: The test quit with a non-zero exit status. E: Par_file_faults not found: assuming that there are no faults

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1417740] pml_ucx.c:419 Error: ucp_ep_create(proc=82) failed: Input/output error

Dragonflydb

Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:10d3M6M9M12M15M15253621.531. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Clients Per Thread: 50 - Set To Get Ratio: 1:10

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 50 - Set To Get Ratio: 1:100d4M8M12M16M20M17583489.381. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Clients Per Thread: 50 - Set To Get Ratio: 1:100

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: Fishy Cat - Compute: CPU-Onlydefgh112233445548.9510.0510.1110.0210.02

Dragonflydb

Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:10d3M6M9M12M15M15179164.861. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Clients Per Thread: 20 - Set To Get Ratio: 1:10

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 20 - Set To Get Ratio: 1:100d3M6M9M12M15M14250093.711. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Clients Per Thread: 20 - Set To Get Ratio: 1:100

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

Kripke

Kripke is a simple, scalable, 3D Sn deterministic particle transport code. Its primary purpose is to research how data layout, programming paradigms and architectures effect the implementation and performance of Sn transport. Kripke is developed by LLNL. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgThroughput FoM, More Is BetterKripke 1.2.6d80M160M240M320M400M3720720001. (CXX) g++ options: -O3 -fopenmp -ldl

e: The test quit with a non-zero exit status. E: [1693391489.513383] [amd-test-01:1711180:0] sock.c:325 UCX ERROR connect(fd=65, dest_addr=127.0.0.1:48801) failed: Connection refused

f: The test quit with a non-zero exit status. E: [1693366512.651572] [amd-test-01:2311490:0] sock.c:325 UCX ERROR connect(fd=39, dest_addr=127.0.0.1:49861) failed: Connection refused

g: The test quit with a non-zero exit status. E: [1693403940.471431] [amd-test-01:1342351:0] sock.c:325 UCX ERROR connect(fd=48, dest_addr=127.0.0.1:44899) failed: Connection refused

h: The test quit with a non-zero exit status. E: 7.469567] [amd-test-01:928103:0] sock.c:325 UCX ERROR connect(fd=44, dest_addr=127.0.0.1:45965) failed: Connection refused

Dragonflydb

Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:100d3M6M9M12M15M11860591.651. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Clients Per Thread: 10 - Set To Get Ratio: 1:100

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

OpenBenchmarking.orgOps/sec, More Is BetterDragonflydb 1.6.2Clients Per Thread: 10 - Set To Get Ratio: 1:10d3M6M9M12M15M11686474.261. (CXX) g++ options: -O2 -levent_openssl -levent -lcrypto -lssl -lpthread -lz -lpcre

Clients Per Thread: 10 - Set To Get Ratio: 1:10

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fastd2468106.7681. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Video Input: Bosphorus 4K - Video Preset: Fast

e: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

f: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

g: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

h: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 4 - Input: Bosphorus 1080pefgh36912159.7319.8739.8319.9011. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Remhos

Remhos (REMap High-Order Solver) is a miniapp that solves the pure advection equations that are used to perform monotonic and conservative discontinuous field interpolation (remap) as part of the Eulerian phase in Arbitrary Lagrangian Eulerian (ALE) simulations. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgSeconds, Fewer Is BetterRemhos 1.0Test: Sample Remap Exampled51015202520.901. (CXX) g++ options: -O3 -std=c++11 -lmfem -lHYPRE -lmetis -lrt -lmpi_cxx -lmpi

Test: Sample Remap Example

e: The test quit with a non-zero exit status. E: [amd-test-01.local:2407743] pml_ucx.c:419 Error: ucp_ep_create(proc=165) failed: Endpoint timeout

f: The test quit with a non-zero exit status. E: [amd-test-01.local:2764399] pml_ucx.c:419 Error: ucp_ep_create(proc=45) failed: Destination is unreachable

g: The test quit with a non-zero exit status. E: [amd-test-01.local:1784682] 3 more processes have sent help message help-mpi-errors.txt / mpi_errors_are_fatal unknown handle

h: The test quit with a non-zero exit status. E: [amd-test-01.local:1413726] pml_ucx.c:419 Error: ucp_ep_create(proc=79) failed: Endpoint timeout

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragon Objdefgh408012016020037.99184.11180.14193.05172.21MIN: 37.71 / MAX: 38.4MIN: 161.71 / MAX: 195.69MIN: 156.79 / MAX: 192.08MIN: 174.03 / MAX: 205.21MIN: 150.53 / MAX: 183.32

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RTLightmap.hdr.4096x4096 - Device: CPU-Onlydefgh0.34650.6931.03951.3861.73250.651.501.471.501.54

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragon Objdefgh408012016020040.57187.01184.24193.43184.45MIN: 40.28 / MAX: 41.19MIN: 157.55 / MAX: 203.21MIN: 158.98 / MAX: 198.61MIN: 170.24 / MAX: 203.63MIN: 159.04 / MAX: 199.2

Blender

OpenBenchmarking.orgSeconds, Fewer Is BetterBlender 3.6Blend File: BMW27 - Compute: CPU-Onlydefgh91827364537.937.447.527.517.40

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 4 - Input: Bosphorus 1080pdefgh4812162014.9713.1913.4513.5812.911. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 4K - Video Preset: Fasterd369121512.681. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Video Input: Bosphorus 4K - Video Preset: Faster

e: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

f: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

g: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

h: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 8 - Input: Bosphorus 4Kdefgh2040608010072.3573.6271.7175.1573.251. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Intel Open Image Denoise

Open Image Denoise is a denoising library for ray-tracing and part of the Intel oneAPI rendering toolkit. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.ldr_alb_nrm.3840x2160 - Device: CPU-Onlydefgh0.77851.5572.33553.1143.89251.373.183.073.283.46

OpenBenchmarking.orgImages / Sec, More Is BetterIntel Open Image Denoise 2.0Run: RT.hdr_alb_nrm.3840x2160 - Device: CPU-Onlydefgh0.79431.58862.38293.17723.97151.363.243.253.403.53

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 8 - Input: Bosphorus 4Kefgh2040608010093.2893.6692.6494.831. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Dragonflydb

Dragonfly is an open-source database server that is a "modern Redis replacement" that aims to be the fastest memory store while being compliant with the Redis and Memcached protocols. For benchmarking Dragonfly, Memtier_benchmark is used as a NoSQL Redis/Memcache traffic generation plus benchmarking tool developed by Redis Labs. Learn more via the OpenBenchmarking.org test page.

Clients Per Thread: 60 - Set To Get Ratio: 1:100

d: The test run did not produce a result. E: Connection error: Connection reset by peer

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

Clients Per Thread: 60 - Set To Get Ratio: 1:10

d: The test run did not produce a result. E: Connection error: Connection reset by peer

e: The test run did not produce a result. E: Connection error: Connection refused

f: The test run did not produce a result. E: Connection error: Connection refused

g: The test run did not produce a result. E: Connection error: Connection refused

h: The test run did not produce a result. E: Connection error: Connection refused

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Crowndefgh408012016020037.68189.44185.36183.40187.42MIN: 37.26 / MAX: 39.44MIN: 184.47 / MAX: 200.52MIN: 180.8 / MAX: 195.2MIN: 179.05 / MAX: 191.46MIN: 183.21 / MAX: 193.83

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Crowndefgh408012016020039.38189.19189.32193.60194.27MIN: 38.86 / MAX: 40.97MIN: 184.06 / MAX: 202.84MIN: 183.01 / MAX: 202.71MIN: 188.25 / MAX: 201.2MIN: 188.82 / MAX: 203.12

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fastd51015202519.171. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Video Input: Bosphorus 1080p - Video Preset: Fast

e: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

f: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

g: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

h: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer - Model: Asian Dragondefgh5010015020025042.76205.71206.17217.59207.25MIN: 42.48 / MAX: 43.18MIN: 201.09 / MAX: 214.82MIN: 199.24 / MAX: 215.7MIN: 211.81 / MAX: 228.26MIN: 203.29 / MAX: 214.14

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 8 - Input: Bosphorus 1080pefgh306090120150123.52124.81125.22124.701. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 8 - Input: Bosphorus 1080pdefgh306090120150136.07133.87134.06134.88134.381. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Embree

Intel Embree is a collection of high-performance ray-tracing kernels for execution on CPUs (and GPUs via SYCL) and supporting instruction sets such as SSE, AVX, AVX2, and AVX-512. Embree also supports making use of the Intel SPMD Program Compiler (ISPC). Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterEmbree 4.1Binary: Pathtracer ISPC - Model: Asian Dragondefgh5010015020025048.47227.03224.90239.42225.83MIN: 48.1 / MAX: 49.28MIN: 217.82 / MAX: 238.94MIN: 217.06 / MAX: 234.36MIN: 232.82 / MAX: 253.03MIN: 221.04 / MAX: 232.51

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 12 - Input: Bosphorus 4Kefgh4080120160200169.93196.32179.41187.681. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 13 - Input: Bosphorus 4Kdefgh4080120160200195.83188.74161.82191.67177.361. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 12 - Input: Bosphorus 4Kdefgh4080120160200199.52169.89183.47189.16194.861. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 13 - Input: Bosphorus 4Kefgh4080120160200195.36202.49203.01197.031. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

VVenC

VVenC is the Fraunhofer Versatile Video Encoder as a fast/efficient H.266/VVC encoder. The vvenc encoder makes use of SIMD Everywhere (SIMDe). The vvenc software is published under the Clear BSD License. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterVVenC 1.9Video Input: Bosphorus 1080p - Video Preset: Fasterd81624324033.571. (CXX) g++ options: -O3 -flto -fno-fat-lto-objects -flto=auto

Video Input: Bosphorus 1080p - Video Preset: Faster

e: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

f: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

g: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

h: The test quit with a non-zero exit status. E: Parameter Check Error: Number of threads out of range (-1

SVT-AV1

This is a benchmark of the SVT-AV1 open-source video encoder/decoder. SVT-AV1 was originally developed by Intel as part of their Open Visual Cloud / Scalable Video Technology (SVT). Development of SVT-AV1 has since moved to the Alliance for Open Media as part of upstream AV1 development. SVT-AV1 is a CPU-based multi-threaded video encoder for the AV1 video format with a sample YUV video file. Learn more via the OpenBenchmarking.org test page.

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 12 - Input: Bosphorus 1080pdefgh110220330440550419.85490.98503.08490.15485.591. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

Stress-NG

Stress-NG is a Linux stress tool developed by Colin Ian King. Learn more via the OpenBenchmarking.org test page.

Test: IO_uring

d: The test run did not produce a result.

e: The test run did not produce a result.

f: The test run did not produce a result.

g: The test run did not produce a result.

h: The test run did not produce a result.

SVT-AV1

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 12 - Input: Bosphorus 1080pefgh110220330440550495.83483.01511.80507.021. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.6Encoder Mode: Preset 13 - Input: Bosphorus 1080pdefgh130260390520650545.20594.84588.81600.23581.071. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

OpenBenchmarking.orgFrames Per Second, More Is BetterSVT-AV1 1.7Encoder Mode: Preset 13 - Input: Bosphorus 1080pefgh130260390520650602.21602.99599.57600.761. (CXX) g++ options: -march=native -mno-avx -mavx2 -mavx512f -mavx512bw -mavx512dq

196 Results Shown

BRL-CAD
NCNN:
  CPU - FastestDet
  CPU - vision_transformer
  CPU - regnety_400m
  CPU - squeezenet_ssd
  CPU - yolov4-tiny
  CPU - resnet50
  CPU - alexnet
  CPU - resnet18
  CPU - vgg16
  CPU - googlenet
  CPU - blazeface
  CPU - efficientnet-b0
  CPU - mnasnet
  CPU - shufflenet-v2
  CPU-v3-v3 - mobilenet-v3
  CPU-v2-v2 - mobilenet-v2
  CPU - mobilenet
OSPRay
Neural Magic DeepSparse:
  BERT-Large, NLP Question Answering, Sparse INT8 - Synchronous Single-Stream:
    ms/batch
    items/sec
OSPRay
Neural Magic DeepSparse:
  BERT-Large, NLP Question Answering - Synchronous Single-Stream:
    ms/batch
    items/sec
Blender
Apache Cassandra
Neural Magic DeepSparse:
  BERT-Large, NLP Question Answering - Asynchronous Multi-Stream:
    ms/batch
    items/sec
OSPRay
Neural Magic DeepSparse:
  NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Synchronous Single-Stream:
    ms/batch
    items/sec
OSPRay:
  gravity_spheres_volume/dim_512/scivis/real_time
  gravity_spheres_volume/dim_512/ao/real_time
Stress-NG
Neural Magic DeepSparse:
  CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Text Classification, BERT base uncased SST2 - Synchronous Single-Stream:
    ms/batch
    items/sec
  NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Synchronous Single-Stream:
    ms/batch
    items/sec
  NLP Text Classification, BERT base uncased SST2 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
OSPRay
Neural Magic DeepSparse:
  NLP Token Classification, BERT base uncased conll2003 - Synchronous Single-Stream:
    ms/batch
    items/sec
  NLP Document Classification, oBERT base uncased on IMDB - Synchronous Single-Stream:
    ms/batch
    items/sec
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Sentiment Analysis, 80% Pruned Quantized BERT Base Uncased - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Segmentation, 90% Pruned YOLACT Pruned - Synchronous Single-Stream:
    ms/batch
    items/sec
Blender
Neural Magic DeepSparse:
  NLP Question Answering, BERT base uncased SQuaD 12layer Pruned90 - Synchronous Single-Stream:
    ms/batch
    items/sec
  NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  NLP Text Classification, DistilBERT mnli - Synchronous Single-Stream:
    ms/batch
    items/sec
SVT-AV1
Neural Magic DeepSparse:
  CV Detection, YOLOv5s COCO - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Detection, YOLOv5s COCO - Synchronous Single-Stream:
    ms/batch
    items/sec
  CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Detection, YOLOv5s COCO, Sparse INT8 - Synchronous Single-Stream:
    ms/batch
    items/sec
  ResNet-50, Baseline - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  ResNet-50, Sparse INT8 - Asynchronous Multi-Stream:
    ms/batch
    items/sec
  CV Classification, ResNet-50 ImageNet - Synchronous Single-Stream:
    ms/batch
    items/sec
  ResNet-50, Baseline - Synchronous Single-Stream:
    ms/batch
    items/sec
  ResNet-50, Sparse INT8 - Synchronous Single-Stream:
    ms/batch
    items/sec
Laghos
SVT-AV1
Stress-NG:
  NUMA
  Atomic
nekRS
Blender
Stress-NG:
  Pthread
  Cloning
  Futex
  Malloc
  MEMFD
  Mixed Scheduler
  MMAP
  Zlib
  Fused Multiply-Add
  Vector Shuffle
  Poll
  Hash
  Forking
  Matrix Math
  System V Message Passing
  Vector Floating Point
  Socket Activity
  Glibc C String Functions
  Mutex
  Function Call
  Crypto
  Memory Copying
  Glibc Qsort Data Sorting
  Wide Vector Math
  AVL Tree
  Matrix 3D Math
  x86_64 RdRand
  Vector Math
  AVX-512 VNNI
  Floating Point
  Semaphores
  CPU Stress
  SENDFILE
  Pipe
  Context Switching
Liquid-DSP:
  64 - 256 - 512
  32 - 256 - 512
  16 - 256 - 512
  8 - 256 - 512
  64 - 256 - 57
  64 - 256 - 32
  4 - 256 - 512
  32 - 256 - 57
  2 - 256 - 512
  16 - 256 - 57
  32 - 256 - 32
  1 - 256 - 32
  16 - 256 - 32
  1 - 256 - 512
  8 - 256 - 57
  8 - 256 - 32
  4 - 256 - 57
  4 - 256 - 32
  2 - 256 - 57
  2 - 256 - 32
  1 - 256 - 57
Laghos
nekRS
SPECFEM3D
Timed Linux Kernel Compilation
SPECFEM3D:
  Water-layered Halfspace
  Homogeneous Halfspace
  Mount St. Helens
  Tomographic Model
Dragonflydb:
  50 - 1:10
  50 - 1:100
Blender
Dragonflydb:
  20 - 1:10
  20 - 1:100
Kripke
Dragonflydb:
  10 - 1:100
  10 - 1:10
VVenC
SVT-AV1
Remhos
Embree
Intel Open Image Denoise
Embree
Blender
SVT-AV1
VVenC
SVT-AV1
Intel Open Image Denoise:
  RT.ldr_alb_nrm.3840x2160 - CPU-Only
  RT.hdr_alb_nrm.3840x2160 - CPU-Only
SVT-AV1
Embree:
  Pathtracer - Crown
  Pathtracer ISPC - Crown
VVenC
Embree
SVT-AV1
SVT-AV1
Embree
SVT-AV1
SVT-AV1:
  Preset 13 - Bosphorus 4K
  Preset 12 - Bosphorus 4K
SVT-AV1
VVenC
SVT-AV1
SVT-AV1
SVT-AV1
SVT-AV1