VIT Example
This document describes using the QAI AppBuilder Python API to perform inference with the ViT target recognition model using Qualcomm® Hexagon™ Processor (NPU).
Supported Devices
| Device | SoC |
|---|---|
| Dragon Q6A | QCS6490 |
| Dragon Q8B | SC8280XP |
Install QAI AppBuilder
-
Please install QAI AppBuilder according to QAI AppBuilder Installation Guide.
-
Please configure ADSP environment variables according to Create ADSP Environment Variables.
Run Example
Install Dependencies
pip3 install requests tqdm qai-hub py3_wget opencv-python torch torchvision
Run Script
-
Navigate to the example directory
- QCS6490
- SC8280XP
Devicecd ai-engine-direct-helper/samples/linux/pythonDevicecd ai-engine-direct-helper/samples/linux/python -
Prepare input image, using the following image as input example

input image
-
Execute inference
Devicepython3 vit/vit.py$ python3 vit/vit.py
Current file directory: /mnt/ssd/qualcomm/701/zzf_fork/ai-engine-direct-helper/samples/linux/python/vit
0.0ms [WARNING] <W> Initializing HtpProvider
/prj/qct/webtech_scratch20/mlg_user_admin/qaisw_source_repo/rel/qairt-2.37.1/point_release/SNPE_SRC/avante-tools/prebuilt/dsp/hexagon-sdk-5.4.0/ipc/fastrpc/rpcmem/src/rpcmem_android.c:38:dummy call to rpcmem_init, rpcmem APIs will be used from libxdsprpc
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
148.6ms [WARNING] Time: Read model file to memory. 32.50
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
351.2ms [WARNING] Time: contextCreateFromBinary. 202.52
351.2ms [WARNING] Time: UnmapViewOfFile. 0.00
351.9ms [WARNING] Time: model_initialize vit 351.87
560.6ms [WARNING] Time: model_inference vit 159.35
Top 5 predictions for image:
Samoyed 0.7157644629
Pomeranian 0.0566752702
keeshond 0.0083217919
Eskimo dog 0.0057556415
Japanese spaniel 0.003955483
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
0.0ms [WARNING] <W> This META does not have Alloc2 Support
/prj/qct/webtech_scratch20/mlg_user_admin/qaisw_source_repo/rel/qairt-2.37.1/point_release/SNPE_SRC/avante-tools/prebuilt/dsp/hexagon-sdk-5.4.0/ipc/fastrpc/rpcmem/src/rpcmem_android.c:42:dummy call to rpcmem_deinit, rpcmem APIs will be used from libxdsprpc
585.7ms [WARNING] Time: model_destroy vit 20.11The printed results show that
Samoyedhas the highest confidence, which matches the input image content.