new amp

ARMv8 Neoverse-N1 testing with a GIGABYTE G242-P36-00 MP32-AR2-00 v01000100 (F31k SCP: 2.10.20220531 BIOS) and ASPEED on Ubuntu 23.10 via the Phoronix Test Suite.

HTML result view exported from: https://openbenchmarking.org/result/2402068-NE-NEWAMP18865&grs.

Llamafile

Test: mistral-7b-instruct-v0.2.Q8_0 - Acceleration: CPU

Llamafile

Test: llava-v1.5-7b-q4 - Acceleration: CPU

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

Llamafile

Test: wizardcoder-python-34b-v1.0.Q6_K - Acceleration: CPU

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard

ONNX Runtime

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

ONNX Runtime

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel

LZ4 Compression

Compression Level: 1 - Decompression Speed

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Parallel

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

LZ4 Compression

Compression Level: 9 - Compression Speed

LZ4 Compression

Compression Level: 1 - Compression Speed

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel

LZ4 Compression

Compression Level: 3 - Decompression Speed

LZ4 Compression

Compression Level: 3 - Compression Speed

LZ4 Compression

Compression Level: 9 - Decompression Speed

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

ONNX Runtime

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Standard

ONNX Runtime

Model: Faster R-CNN R-50-FPN-int8 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Standard

ONNX Runtime

Model: super-resolution-10 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Standard

ONNX Runtime

Model: ResNet50 v1-12-int8 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Standard

ONNX Runtime

Model: ArcFace ResNet-100 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Standard

ONNX Runtime

Model: fcn-resnet101-11 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Standard

ONNX Runtime

Model: CaffeNet 12-int8 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Standard

ONNX Runtime

Model: bertsquad-12 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Standard

ONNX Runtime

Model: T5 Encoder - Device: CPU - Executor: Parallel

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Standard

ONNX Runtime

Model: yolov4 - Device: CPU - Executor: Parallel

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Standard

ONNX Runtime

Model: GPT-2 - Device: CPU - Executor: Parallel

Phoronix Test Suite v10.8.5