Gigabyte G242-P36 Ampere Altra Max Server

Benchmarks by Michael Larabel for a future article.

HTML result view exported from: https://openbenchmarking.org/result/2401176-NE-GIGABYTEG67&grr&sro&rro.

Gigabyte G242-P36 Ampere Altra Max ServerProcessorMotherboardChipsetMemoryDiskGraphicsMonitorNetworkOSKernelCompilerFile-SystemScreen ResolutionG242-P36gigddARMv8 Neoverse-N1 @ 3.00GHz (128 Cores)GIGABYTE G242-P36-00 MP32-AR2-00 v01000100 (F31k SCP: 2.10.20220531 BIOS)Ampere Computing LLC Altra PCI Root Complex A16 x 32 GB DDR4-3200MT/s Samsung M393A4K40DB3-CWE800GB Micron_7450_MTFDKBA800TFSASPEEDVGA HDMI2 x Intel I350Ubuntu 23.106.5.0-13-generic (aarch64)GCC 13.2.0ext41920x1080OpenBenchmarking.orgKernel Details- Transparent Huge Pages: madviseCompiler Details- --build=aarch64-linux-gnu --disable-libquadmath --disable-libquadmath-support --disable-werror --enable-bootstrap --enable-checking=release --enable-clocale=gnu --enable-default-pie --enable-fix-cortex-a53-843419 --enable-gnu-unique-object --enable-languages=c,ada,c++,go,d,fortran,objc,obj-c++,m2 --enable-libphobos-checking=release --enable-libstdcxx-debug --enable-libstdcxx-time=yes --enable-link-serialization=2 --enable-multiarch --enable-nls --enable-objc-gc=auto --enable-plugin --enable-shared --enable-threads=posix --host=aarch64-linux-gnu --program-prefix=aarch64-linux-gnu- --target=aarch64-linux-gnu --with-build-config=bootstrap-lto-lean --with-default-libstdcxx-abi=new --with-gcc-major-version-only --with-target-system-zlib=auto -v Processor Details- Scaling Governor: cppc_cpufreq performance (Boost: Disabled)Python Details- Python 3.11.6Security Details- gather_data_sampling: Not affected + itlb_multihit: Not affected + l1tf: Not affected + mds: Not affected + meltdown: Not affected + mmio_stale_data: Not affected + retbleed: Not affected + spec_rstack_overflow: Not affected + spec_store_bypass: Mitigation of SSB disabled via prctl + spectre_v1: Mitigation of __user pointer sanitization + spectre_v2: Mitigation of CSV2 BHB + srbds: Not affected + tsx_async_abort: Not affected

Gigabyte G242-P36 Ampere Altra Max Serverpytorch: CPU - 1 - Efficientnet_v2_lpytorch: CPU - 16 - ResNet-152pytorch: CPU - 16 - ResNet-50pytorch: CPU - 1 - ResNet-152pytorch: CPU - 1 - ResNet-50xmrig: Wownero - 1Mspeedb: Seq Fillxmrig: Monero - 1Mdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: ResNet-50, Sparse INT8 - Asynchronous Multi-Streambuild-llvm: Unix Makefileslczero: BLASlczero: Eigendeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamdeepsparse: CV Segmentation, 90% Pruned YOLACT Pruned - Asynchronous Multi-Streamquicksilver: CTS2build-linux-kernel: allmodconfigstress-ng: Atomicbuild-llvm: Ninjallama-cpp: llama-2-70b-chat.Q5_0.ggufstockfish: Total Timeopenssl: ChaCha20-Poly1305openssl: ChaCha20openssl: AES-256-GCMspeedb: Read While Writingrocksdb: Rand Readquicksilver: CORAL2 P2openssl: AES-128-GCMopenssl: SHA256openssl: SHA512speedb: Rand Readrocksdb: Read While Writingllama-cpp: llama-2-13b.Q4_0.ggufcachebench: Readcachebench: Read / Modify / Writecachebench: Writerocksdb: Read Rand Write Randstress-ng: Futexstress-ng: Context Switchingdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering - Asynchronous Multi-Streamamg: build-linux-kernel: defconfigdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: BERT-Large, NLP Question Answering, Sparse INT8 - Asynchronous Multi-Streamgromacs: MPI CPU - water_GMX50_barestress-ng: MEMFDspeedb: Rand Fillspeedb: Rand Fill Syncspeedb: Update Randspeedb: Read Rand Write Randrocksdb: Update Randopenssl: RSA4096openssl: RSA4096deepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Document Classification, oBERT base uncased on IMDB - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: NLP Token Classification, BERT base uncased conll2003 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO, Sparse INT8 - Asynchronous Multi-Streamquicksilver: CORAL2 P1deepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: CV Detection, YOLOv5s COCO - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: NLP Text Classification, DistilBERT mnli - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: ResNet-50, Baseline - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamdeepsparse: CV Classification, ResNet-50 ImageNet - Asynchronous Multi-Streamcompress-7zip: Decompression Ratingcompress-7zip: Compression Ratingstress-ng: IO_uringstress-ng: MMAPstress-ng: Cloningstress-ng: Mallocstress-ng: CPU Cachestress-ng: Pthreadstress-ng: Zlibstress-ng: Vector Shufflestress-ng: Vector Mathstress-ng: Wide Vector Mathstress-ng: Matrix Mathstress-ng: Function Callstress-ng: Matrix 3D Mathstress-ng: CPU Stressstress-ng: AVL Treestress-ng: Cryptostress-ng: Fused Multiply-Addstress-ng: Hashstress-ng: SENDFILEstress-ng: AVX-512 VNNIstress-ng: Glibc Qsort Data Sortingstress-ng: Vector Floating Pointstress-ng: Floating Pointstress-ng: Pollstress-ng: Glibc C String Functionsstress-ng: System V Message Passingstress-ng: Forkingstress-ng: Memory Copyingstress-ng: Semaphoresstress-ng: Mutexstress-ng: Mixed Schedulerstress-ng: NUMAstress-ng: Pipestress-ng: Socket Activityllama-cpp: llama-2-7b.Q4_0.ggufminife: Smallmt-dgemm: Sustained Floating-Point RateG242-P36gigdd0.300.671.830.681.911935.22950794201.723.50042677.0708411.52162481358.177345.678816203333308.2977.29266.3333.07188653177112213448840161732226070306487842680129050354340523552554333338268820730010132296175334478769590409571625855884513.9011438.27651645034.97615638239.9707303320337343012.7520365273.281320.135447.0250105706433378.703146.7462430.13754.588574.852849872073762722752419683431406517886.06342.855.57031137.7811830.576033.74731834.579933.6229310.8403202.227925273333314.1452200.0280185.3571339.9765132.3047476.3781132.1032477.8141537647333316604943.761088.777795.96164364343.39879814.35113551.875987.8886218.95398869.872346519.63681885.3072283.185099.8133761.08299.50252315.26151220570.5115671801.481624492.924690386.642020.18102535.3522213.547330369.9662783286.4821143237.7252250.5327153.74167637763.5937172432.6636794.331419.0630330081.1828009.0721.5823996.017.78498329005924.01252624.7719408.27159471334.543346.727316460000309.4775.64267.863.13177653916112250396400161791663040306544534870132553414505009122552000038285632826010003959375034453399030418448304851606014.0211438.66616145027.47270138251.5919243449038323012.9619654874.851326.958146.5531106013600080.078149.4774421.3454.688576.532782642044102649982518519427908518115.96345.655.42331141.4511830.717433.8691832.115433.5823310.6422202.633225810000315.8962198.9064185.8675339.5239132.3228477.0964133.4484472.0699541204333057612149.931104.197312.78164067515.18882510.28112993.155993.7486375.79398993.462355564.94682490.7572298.235082.6533765.26299.1251986.12151387869.7615654462.921624969.464691697.852022.01102553.1122219.87392099.8262867317.1621054213.7950130.9727162.14167850957.6837215286.0436309.291416.0329805509.1227959.8521.924150.718.2727528576623.42092684.8341407.1960481336.392446.499816430000310.1376.8264.7443.14226859548137855304042918132446000038279302868010132123745034448701700420437471863656314.1111438.86384745041.15485338252.628443537322318037.9320708288.981327.996246.419480.243145.2837433.8593569.362853162078912647482473336443804518085.76345.355.72221135.43651843.239633.15921850.226433.2422311.5325201.755425510000316.9118198.3147183.6437343.7639132.2627477.6899130.6575483.308541552331579583751.831092.256918.49164592319.96882225.34113379.285985.6986257.77399042.092354926.97682554.3372290.815089.1933559.87299.99251996.36151037296.4615654282.581624702.094692452.82020.3102604.7422220.77395099.6462845443.5321119614.3150686.5827159.07166379337.6737267646.9136361.291426.4530776841.7327536.7926.64OpenBenchmarking.org

PyTorch

Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_l

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 1 - Model: Efficientnet_v2_lG242-P360.06750.1350.20250.270.3375SE +/- 0.00, N = 30.30MIN: 0.27 / MAX: 0.4

PyTorch

Device: CPU - Batch Size: 16 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 16 - Model: ResNet-152G242-P360.15080.30160.45240.60320.754SE +/- 0.00, N = 20.67MIN: 0.65 / MAX: 0.7

PyTorch

Device: CPU - Batch Size: 16 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 16 - Model: ResNet-50G242-P360.41180.82361.23541.64722.059SE +/- 0.02, N = 51.83MIN: 1.7 / MAX: 2.02

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-152

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 1 - Model: ResNet-152G242-P360.1530.3060.4590.6120.765SE +/- 0.00, N = 30.68MIN: 0.65 / MAX: 0.7

PyTorch

Device: CPU - Batch Size: 1 - Model: ResNet-50

OpenBenchmarking.orgbatches/sec, More Is BetterPyTorch 2.1Device: CPU - Batch Size: 1 - Model: ResNet-50G242-P360.42980.85961.28941.71922.149SE +/- 0.00, N = 31.91MIN: 1.8 / MAX: 2.09

Xmrig

Variant: Wownero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Wownero - Hash Count: 1MG242-P36400800120016002000SE +/- 2.92, N = 31935.21. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Speedb

Test: Sequential Fill

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Sequential FillgigddG242-P3660K120K180K240K300KSE +/- 3101.60, N = 52900592857662950791. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Xmrig

Variant: Monero - Hash Count: 1M

OpenBenchmarking.orgH/s, More Is BetterXmrig 6.21Variant: Monero - Hash Count: 1MG242-P369001800270036004500SE +/- 17.55, N = 34201.71. (CXX) g++ options: -fexceptions -fno-rtti -O3 -Ofast -static-libgcc -static-libstdc++ -rdynamic -lssl -lcrypto -luv -lpthread -lrt -ldl -lhwloc

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P36612182430SE +/- 0.21, N = 324.0123.4223.50

Neural Magic DeepSparse

Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P366001200180024003000SE +/- 25.24, N = 32624.772684.832677.07

Timed LLVM Compilation

Build System: Unix Makefiles

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: Unix MakefilesgigddG242-P3690180270360450SE +/- 1.15, N = 3408.27407.19411.52

LeelaChessZero

Backend: BLAS

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.30Backend: BLASgigddG242-P361428425670SE +/- 0.58, N = 35960621. (CXX) g++ options: -flto -pthread

LeelaChessZero

Backend: Eigen

OpenBenchmarking.orgNodes Per Second, More Is BetterLeelaChessZero 0.30Backend: EigengigddG242-P361122334455SE +/- 0.33, N = 34748481. (CXX) g++ options: -flto -pthread

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamgigddG242-P3630060090012001500SE +/- 8.37, N = 31334.541336.391358.18

Neural Magic DeepSparse

Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Segmentation, 90% Pruned YOLACT Pruned - Scenario: Asynchronous Multi-StreamgigddG242-P361122334455SE +/- 0.32, N = 346.7346.5045.68

Quicksilver

Input: CTS2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CTS2gigddG242-P364M8M12M16M20MSE +/- 42557.15, N = 31646000016430000162033331. (CXX) g++ options: -fopenmp -O3 -march=native

Timed Linux Kernel Compilation

Build: allmodconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: allmodconfiggigddG242-P3670140210280350SE +/- 1.01, N = 3309.48310.14308.30

Stress-NG

Test: Atomic

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AtomicgigddG242-P36246810SE +/- 0.59, N = 155.646.807.291. (CXX) g++ options: -O2 -std=gnu99 -lc

Timed LLVM Compilation

Build System: Ninja

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed LLVM Compilation 16.0Build System: NinjagigddG242-P3660120180240300SE +/- 0.67, N = 3267.86264.74266.33

Llama.cpp

Model: llama-2-70b-chat.Q5_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-70b-chat.Q5_0.ggufgigddG242-P360.70651.4132.11952.8263.5325SE +/- 0.03, N = 83.133.143.071. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -mcpu=native -lopenblas

Stockfish

Total Time

OpenBenchmarking.orgNodes Per Second, More Is BetterStockfish 15Total TimegigddG242-P3650M100M150M200M250MSE +/- 6857171.33, N = 151776539162268595481886531771. (CXX) g++ options: -lgcov -lpthread -fno-exceptions -std=c++17 -fno-peel-loops -fno-tracer -pedantic -O3 -flto -flto=jobserver

OpenSSL

Algorithm: ChaCha20-Poly1305

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20-Poly1305gigG242-P3620000M40000M60000M80000M100000MSE +/- 361309.16, N = 31122503964001122134488401. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: ChaCha20

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: ChaCha20gigG242-P3630000M60000M90000M120000M150000MSE +/- 10001054.79, N = 31617916630401617322260701. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: AES-256-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-256-GCMgigG242-P3670000M140000M210000M280000M350000MSE +/- 40660594.45, N = 33065445348703064878426801. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Speedb

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read While WritinggigddG242-P363M6M9M12M15MSE +/- 201662.23, N = 151325534113785530129050351. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Random ReadgigddG242-P36100M200M300M400M500MSE +/- 4162622.50, N = 154505009124042918134340523551. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Quicksilver

Input: CORAL2 P2

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P2gigddG242-P365M10M15M20M25MSE +/- 84129.53, N = 32552000024460000255433331. (CXX) g++ options: -fopenmp -O3 -march=native

OpenSSL

Algorithm: AES-128-GCM

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: AES-128-GCMgigddG242-P3680000M160000M240000M320000M400000MSE +/- 3586455.40, N = 33828563282603827930286803826882073001. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA256

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA256gigddG242-P3620000M40000M60000M80000M100000MSE +/- 64411674.99, N = 31000395937501013212374501013229617531. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: SHA512

OpenBenchmarking.orgbyte/s, More Is BetterOpenSSL 3.1Algorithm: SHA512gigddG242-P367000M14000M21000M28000M35000MSE +/- 8688088.34, N = 33445339903034448701700344787695901. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Speedb

Test: Random Read

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random ReadgigddG242-P3690M180M270M360M450MSE +/- 2947408.87, N = 114184483044204374714095716251. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Read While Writing

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read While WritinggigddG242-P362M4M6M8M10MSE +/- 68677.29, N = 98516060863656385588451. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Llama.cpp

Model: llama-2-13b.Q4_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-13b.Q4_0.ggufgigddG242-P3648121620SE +/- 0.16, N = 1514.0214.1113.901. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -mcpu=native -lopenblas

CacheBench

Test: Read

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: ReadgigddG242-P362K4K6K8K10KSE +/- 0.01, N = 311438.6711438.8611438.28MIN: 11438.33 / MAX: 11438.85MIN: 11438.05 / MAX: 11439.05MIN: 11437.32 / MAX: 11438.591. (CC) gcc options: -O3 -lrt

CacheBench

Test: Read / Modify / Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: Read / Modify / WritegigddG242-P3610K20K30K40K50KSE +/- 2.04, N = 345027.4745041.1545034.98MIN: 43694.36 / MAX: 45640.07MIN: 43693.38 / MAX: 45647.65MIN: 43692.22 / MAX: 45639.261. (CC) gcc options: -O3 -lrt

CacheBench

Test: Write

OpenBenchmarking.orgMB/s, More Is BetterCacheBenchTest: WritegigddG242-P368K16K24K32K40KSE +/- 1.22, N = 338251.5938252.6338239.97MIN: 35289.91 / MAX: 41383.99MIN: 35291.37 / MAX: 41384.3MIN: 35288.52 / MAX: 413821. (CC) gcc options: -O3 -lrt

RocksDB

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Read Random Write RandomgigddG242-P36800K1600K2400K3200K4000KSE +/- 30568.75, N = 73449038353732233203371. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Stress-NG

Test: Futex

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: FutexgigddG242-P3670K140K210K280K350KSE +/- 7072.24, N = 15323012.96318037.93343012.751. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Context Switching

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Context SwitchinggigddG242-P364M8M12M16M20MSE +/- 174052.70, N = 1519654874.8520708288.9820365273.281. (CXX) g++ options: -O2 -std=gnu99 -lc

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamgigddG242-P3630060090012001500SE +/- 0.85, N = 31326.961328.001320.14

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering - Scenario: Asynchronous Multi-StreamgigddG242-P361122334455SE +/- 0.03, N = 346.5546.4247.03

Algebraic Multi-Grid Benchmark

OpenBenchmarking.orgFigure Of Merit, More Is BetterAlgebraic Multi-Grid Benchmark 1.2gigG242-P36200M400M600M800M1000MSE +/- 47484.50, N = 3106013600010570643331. (CC) gcc options: -lparcsr_ls -lparcsr_mv -lseq_mv -lIJ_mv -lkrylov -lHYPRE_utilities -lm -fopenmp -lmpi

Timed Linux Kernel Compilation

Build: defconfig

OpenBenchmarking.orgSeconds, Fewer Is BetterTimed Linux Kernel Compilation 6.1Build: defconfiggigddG242-P3620406080100SE +/- 0.82, N = 380.0880.2478.70

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P36306090120150SE +/- 1.58, N = 3149.48145.28146.75

Neural Magic DeepSparse

Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: BERT-Large, NLP Question Answering, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P3690180270360450SE +/- 4.70, N = 3421.35433.86430.14

GROMACS

Implementation: MPI CPU - Input: water_GMX50_bare

OpenBenchmarking.orgNs Per Day, More Is BetterGROMACS 2023Implementation: MPI CPU - Input: water_GMX50_baregigG242-P361.05482.10963.16444.21925.274SE +/- 0.002, N = 34.6884.5881. (CXX) g++ options: -O3

Stress-NG

Test: MEMFD

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MEMFDgigddG242-P36120240360480600SE +/- 4.82, N = 8576.53569.36574.851. (CXX) g++ options: -O2 -std=gnu99 -lc

Speedb

Test: Random Fill

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random FillgigddG242-P3660K120K180K240K300KSE +/- 1985.22, N = 32782642853162849871. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Speedb

Test: Random Fill Sync

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Random Fill SyncgigddG242-P3640K80K120K160K200KSE +/- 1986.97, N = 32044102078912073761. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Speedb

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Update RandomgigddG242-P3660K120K180K240K300KSE +/- 1573.56, N = 32649982647482722751. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

Speedb

Test: Read Random Write Random

OpenBenchmarking.orgOp/s, More Is BetterSpeedb 2.7Test: Read Random Write RandomgigddG242-P36500K1000K1500K2000K2500KSE +/- 21596.32, N = 32518519247333624196831. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

RocksDB

Test: Update Random

OpenBenchmarking.orgOp/s, More Is BetterRocksDB 8.0Test: Update RandomgigddG242-P36100K200K300K400K500KSE +/- 4409.44, N = 34279084438044314061. (CXX) g++ options: -O3 -march=native -pthread -fno-builtin-memcmp -fno-rtti -lpthread

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgverify/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096gigddG242-P36110K220K330K440K550KSE +/- 27.21, N = 3518115.9518085.7517886.01. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

OpenSSL

Algorithm: RSA4096

OpenBenchmarking.orgsign/s, More Is BetterOpenSSL 3.1Algorithm: RSA4096gigddG242-P3614002800420056007000SE +/- 0.10, N = 36345.66345.36342.81. (CC) gcc options: -pthread -O3 -lssl -lcrypto -ldl

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P361326395265SE +/- 0.09, N = 355.4255.7255.57

Neural Magic DeepSparse

Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, BERT base uncased SST2, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P362004006008001000SE +/- 1.48, N = 31141.451135.441137.78

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamgigddG242-P36400800120016002000SE +/- 0.45, N = 31830.721843.241830.58

Neural Magic DeepSparse

Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Document Classification, oBERT base uncased on IMDB - Scenario: Asynchronous Multi-StreamgigddG242-P36816243240SE +/- 0.08, N = 333.8733.1633.75

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamgigddG242-P36400800120016002000SE +/- 1.29, N = 31832.121850.231834.58

Neural Magic DeepSparse

Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Token Classification, BERT base uncased conll2003 - Scenario: Asynchronous Multi-StreamgigddG242-P36816243240SE +/- 0.04, N = 333.5833.2433.62

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P3670140210280350SE +/- 0.86, N = 3310.64311.53310.84

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO, Sparse INT8 - Scenario: Asynchronous Multi-StreamgigddG242-P364080120160200SE +/- 0.53, N = 3202.63201.76202.23

Quicksilver

Input: CORAL2 P1

OpenBenchmarking.orgFigure Of Merit, More Is BetterQuicksilver 20230818Input: CORAL2 P1gigddG242-P366M12M18M24M30MSE +/- 81103.50, N = 32581000025510000252733331. (CXX) g++ options: -fopenmp -O3 -march=native

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamgigddG242-P3670140210280350SE +/- 0.64, N = 3315.90316.91314.15

Neural Magic DeepSparse

Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Detection, YOLOv5s COCO - Scenario: Asynchronous Multi-StreamgigddG242-P364080120160200SE +/- 0.52, N = 3198.91198.31200.03

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamgigddG242-P364080120160200SE +/- 0.10, N = 3185.87183.64185.36

Neural Magic DeepSparse

Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: NLP Text Classification, DistilBERT mnli - Scenario: Asynchronous Multi-StreamgigddG242-P3670140210280350SE +/- 0.19, N = 3339.52343.76339.98

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamgigddG242-P36306090120150SE +/- 0.18, N = 3132.32132.26132.30

Neural Magic DeepSparse

Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: ResNet-50, Baseline - Scenario: Asynchronous Multi-StreamgigddG242-P36100200300400500SE +/- 0.42, N = 3477.10477.69476.38

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgms/batch, Fewer Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamgigddG242-P36306090120150SE +/- 0.15, N = 3133.45130.66132.10

Neural Magic DeepSparse

Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-Stream

OpenBenchmarking.orgitems/sec, More Is BetterNeural Magic DeepSparse 1.6Model: CV Classification, ResNet-50 ImageNet - Scenario: Asynchronous Multi-StreamgigddG242-P36100200300400500SE +/- 0.36, N = 3472.07483.31477.81

7-Zip Compression

Test: Decompression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Decompression RatinggigddG242-P36120K240K360K480K600KSE +/- 396.38, N = 35412045415525376471. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

7-Zip Compression

Test: Compression Rating

OpenBenchmarking.orgMIPS, More Is Better7-Zip Compression 22.01Test: Compression RatinggigddG242-P3670K140K210K280K350KSE +/- 991.66, N = 33330573315793333161. (CXX) g++ options: -lpthread -ldl -O2 -fPIC

Stress-NG

Test: IO_uring

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: IO_uringgigddG242-P36130K260K390K520K650KSE +/- 5192.48, N = 3612149.93583751.83604943.761. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: MMAP

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MMAPgigddG242-P362004006008001000SE +/- 5.43, N = 31104.191092.251088.771. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Cloning

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CloninggigddG242-P362K4K6K8K10KSE +/- 29.21, N = 37312.786918.497795.961. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Malloc

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MallocgigddG242-P3640M80M120M160M200MSE +/- 296218.44, N = 3164067515.18164592319.96164364343.391. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: CPU Cache

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU CachegigddG242-P36200K400K600K800K1000KSE +/- 1033.74, N = 3882510.28882225.34879814.351. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pthread

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: PthreadgigddG242-P3620K40K60K80K100KSE +/- 65.20, N = 3112993.15113379.28113551.871. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Zlib

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: ZlibgigddG242-P3613002600390052006500SE +/- 0.87, N = 35993.745985.695987.881. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Shuffle

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector ShufflegigddG242-P3620K40K60K80K100KSE +/- 3.20, N = 386375.7986257.7786218.951. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector MathgigddG242-P3690K180K270K360K450KSE +/- 4.53, N = 3398993.46399042.09398869.871. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Wide Vector Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Wide Vector MathgigddG242-P36500K1000K1500K2000K2500KSE +/- 6960.54, N = 32355564.942354926.972346519.631. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Matrix Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix MathgigddG242-P36150K300K450K600K750KSE +/- 404.39, N = 3682490.75682554.33681885.301. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Function Call

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Function CallgigddG242-P3615K30K45K60K75KSE +/- 1.53, N = 372298.2372290.8172283.181. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Matrix 3D Math

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Matrix 3D MathgigddG242-P3611002200330044005500SE +/- 3.74, N = 35082.655089.195099.811. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: CPU Stress

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CPU StressgigddG242-P367K14K21K28K35KSE +/- 1.60, N = 333765.2633559.8733761.081. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: AVL Tree

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVL TreegigddG242-P3670140210280350SE +/- 0.16, N = 3299.10299.99299.501. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Crypto

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: CryptogigddG242-P3650K100K150K200K250KSE +/- 928.63, N = 3251986.12251996.36252315.261. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Fused Multiply-Add

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Fused Multiply-AddgigddG242-P3630M60M90M120M150MSE +/- 110268.18, N = 3151387869.76151037296.46151220570.511. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Hash

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: HashgigddG242-P363M6M9M12M15MSE +/- 9429.94, N = 315654462.9215654282.5815671801.481. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: SENDFILE

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: SENDFILEgigddG242-P36300K600K900K1200K1500KSE +/- 18.53, N = 31624969.461624702.091624492.921. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: AVX-512 VNNI

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: AVX-512 VNNIgigddG242-P361000K2000K3000K4000K5000KSE +/- 401.84, N = 34691697.854692452.804690386.641. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Glibc Qsort Data Sorting

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Glibc Qsort Data SortinggigddG242-P36400800120016002000SE +/- 0.78, N = 32022.012020.302020.181. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Vector Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Vector Floating PointgigddG242-P3620K40K60K80K100KSE +/- 25.89, N = 3102553.11102604.74102535.351. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Floating Point

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Floating PointgigddG242-P365K10K15K20K25KSE +/- 0.42, N = 322219.8022220.7022213.541. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Poll

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: PollgigddG242-P361.6M3.2M4.8M6.4M8MSE +/- 12697.25, N = 37392099.827395099.647330369.961. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Glibc C String Functions

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Glibc C String FunctionsgigddG242-P3613M26M39M52M65MSE +/- 17918.08, N = 362867317.1662845443.5362783286.481. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: System V Message Passing

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: System V Message PassinggigddG242-P365M10M15M20M25MSE +/- 32907.24, N = 321054213.7921119614.3121143237.721. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Forking

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: ForkinggigddG242-P3611K22K33K44K55KSE +/- 410.62, N = 350130.9750686.5852250.531. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Memory Copying

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Memory CopyinggigddG242-P366K12K18K24K30KSE +/- 1.16, N = 327162.1427159.0727153.741. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Semaphores

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: SemaphoresgigddG242-P3640M80M120M160M200MSE +/- 217685.76, N = 3167850957.68166379337.67167637763.591. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Mutex

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: MutexgigddG242-P368M16M24M32M40MSE +/- 9463.26, N = 337215286.0437267646.9137172432.661. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Mixed Scheduler

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Mixed SchedulergigddG242-P368K16K24K32K40KSE +/- 141.59, N = 336309.2936361.2936794.331. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: NUMA

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: NUMAgigddG242-P3630060090012001500SE +/- 2.47, N = 31416.031426.451419.061. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Pipe

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: PipegigddG242-P367M14M21M28M35MSE +/- 95784.06, N = 329805509.1230776841.7330330081.181. (CXX) g++ options: -O2 -std=gnu99 -lc

Stress-NG

Test: Socket Activity

OpenBenchmarking.orgBogo Ops/s, More Is BetterStress-NG 0.16.04Test: Socket ActivitygigddG242-P366K12K18K24K30KSE +/- 159.43, N = 327959.8527536.7928009.071. (CXX) g++ options: -O2 -std=gnu99 -lc

Llama.cpp

Model: llama-2-7b.Q4_0.gguf

OpenBenchmarking.orgTokens Per Second, More Is BetterLlama.cpp b1808Model: llama-2-7b.Q4_0.ggufgigddG242-P36612182430SE +/- 0.21, N = 621.9026.6421.581. (CXX) g++ options: -std=c++11 -fPIC -O3 -pthread -mcpu=native -lopenblas

miniFE

Problem Size: Small

OpenBenchmarking.orgCG Mflops, More Is BetterminiFE 2.2Problem Size: SmallgigG242-P365K10K15K20K25KSE +/- 14.30, N = 424150.723996.01. (CXX) g++ options: -O3 -fopenmp -lmpi_cxx -lmpi

ACES DGEMM

Sustained Floating-Point Rate

OpenBenchmarking.orgGFLOP/s, More Is BetterACES DGEMM 1.0Sustained Floating-Point RategigG242-P3648121620SE +/- 0.09, N = 418.2717.781. (CC) gcc options: -O3 -march=native -fopenmp


Phoronix Test Suite v10.8.5