Skip to main content

VIT Example

This document describes using the QAI AppBuilder Python API to perform inference with the ViT target recognition model using Qualcomm® Hexagon™ Processor (NPU).

Supported Devices

DeviceSoC
Dragon Q6AQCS6490
Dragon Q8BSC8280XP

Install QAI AppBuilder

tip
  1. Please install QAI AppBuilder according to QAI AppBuilder Installation Guide.

  2. Please configure ADSP environment variables according to Create ADSP Environment Variables.

Run Example

Install Dependencies

Device
pip3 install requests tqdm qai-hub py3_wget opencv-python torch torchvision

Run Script

  • Navigate to the example directory

    Device
    cd ai-engine-direct-helper/samples/linux/python
  • Prepare input image, using the following image as input example

    input image

  • Execute inference

    Device
    python3 vit/vit.py
    $ python3 vit/vit.py
    Current file directory: /mnt/ssd/qualcomm/701/zzf_fork/ai-engine-direct-helper/samples/linux/python/vit
    0.0ms [WARNING] <W> Initializing HtpProvider

    /prj/qct/webtech_scratch20/mlg_user_admin/qaisw_source_repo/rel/qairt-2.37.1/point_release/SNPE_SRC/avante-tools/prebuilt/dsp/hexagon-sdk-5.4.0/ipc/fastrpc/rpcmem/src/rpcmem_android.c:38:dummy call to rpcmem_init, rpcmem APIs will be used from libxdsprpc
    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    148.6ms [WARNING] Time: Read model file to memory. 32.50

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    351.2ms [WARNING] Time: contextCreateFromBinary. 202.52

    351.2ms [WARNING] Time: UnmapViewOfFile. 0.00

    351.9ms [WARNING] Time: model_initialize vit 351.87

    560.6ms [WARNING] Time: model_inference vit 159.35

    Top 5 predictions for image:

    Samoyed 0.7157644629
    Pomeranian 0.0566752702
    keeshond 0.0083217919
    Eskimo dog 0.0057556415
    Japanese spaniel 0.003955483
    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    0.0ms [WARNING] <W> This META does not have Alloc2 Support

    /prj/qct/webtech_scratch20/mlg_user_admin/qaisw_source_repo/rel/qairt-2.37.1/point_release/SNPE_SRC/avante-tools/prebuilt/dsp/hexagon-sdk-5.4.0/ipc/fastrpc/rpcmem/src/rpcmem_android.c:42:dummy call to rpcmem_deinit, rpcmem APIs will be used from libxdsprpc
    585.7ms [WARNING] Time: model_destroy vit 20.11

    The printed results show that Samoyed has the highest confidence, which matches the input image content.

    You need to be logged into GitHub to post a comment. If you are already logged in, please ignore this message.

    Radxa-docs © 2026 by Radxa Computer (Shenzhen) Co.,Ltd. is licensed under CC BY 4.0