Skip to main content

Performance Benchmark

Performance benchmark is the best way to understand the performance of the hardware platform.

info

The benchmark test results may differ due to specific application scenarios and model optimization levels, and are for reference only.

Test Description

  • Test tool: axcl_run_model
  • Batch Size: 1 or 8
  • Unit: FPS (Frame/Second)
info

Note: Due to performance differences in memory copy (memcopy) and PCIe across different host platforms, axcl_run_model only measures the inference time of the neural network model on the Device side (excluding host-side operations).

Vision Model

Vision ModelInput SizeBatch 1(IPS)Batch 8(IPS)
Inceptionv122410732494
Inceptionv3224478702
MobileNetv122415084854
MobileNetv222413665073
ResNet1822410662254
ResNet502245761045
SqueezeNet1122415605961
Swin-T224342507
ViT-B/16224162207
YOLOv5s640326394
YOLOv6s640282322
YOLOv8s640248279
YOLOv9s640237
YOLOv10s640298
YOLOv11n640860
YOLOv11s640305
YOLOv11m640114
YOLOv11l64087
YOLOv11x64041

Audio Model

Audio ModelRTF
Whisper-Tiny0.03
Whisper-Small0.18
MeloTTS0.04

LLM

LLMPrompt length(tokens)TTFT(ms)Generate(tokens/s)
Qwen2.5-0.5B12818828

VLM

VLMInput ImageImage Encoder (ms)Prompt length (tokens)TTFT (ms)Generate (tokens/s)
InternVL2-1B448*448420032042529