RKLLM Installation
Introduction to RKLLM
RKLLM helps users deploy LLM models to Rockchip chips quickly. Currently, it supports the rk3588/rk3576 chips. The overall framework is shown below:
Currently Supported Models
- TinyLLAMA 1.1B
- Qwen 1.8B
- Qwen2 0.5B
- Phi-2 2.7B
- Phi-3 3.8B
- ChatGLM3 6B
- Gemma 2B
- InternLM2 1.8B
- MiniCPM 2B
Install RKLLM
To use RKNPU, users first need to run the RKLLM-Toolkit tool on an x86 workstation to convert the trained model to the RKLLM format, then use the RKLLM C API on the development board for inference.
x86 PC Workstation
-
(Optional) Install Anaconda
If Python 3.8 (required version) is not installed on your system or if you have multiple Python versions, it is recommended to use Anaconda to create a new Python 3.8 environment.
-
Install Anaconda
To check if Anaconda is installed, run the following command in the terminal. If it is installed, you can skip this step.
$ conda --version
conda 23.10.0If you see
conda: command not found
, Anaconda is not installed. Please refer to the Anaconda website for installation instructions. -
Create a conda environment
conda create -n rkllm python=3.8
-
Activate the rkllm conda environment
conda activate rkllm
-
Deactivate the environment
conda deactivate
-
-
RKLLM-Toolkit is a software development package for converting and quantizing Huggingface format LLM models on a PC.
git clone https://github.com/airockchip/rknn-llm.git
pip3 install ./rknn-llm/rkllm-toolkit/packages/rkllm_toolkit-1.0.1-cp38-cp38-linux_x86_64.whlIf the following command runs without errors, the installation is successful:
python3
from rkllm.api import RKLLM
Development Board
-
Check if the NPU driver version is 0.9.6 or higher. If it is lower, download and flash the latest Radxa 6.1 firmware.
$ sudo cat /sys/kernel/debug/rknpu/version
RKNPU driver: v0.9.6(Optional) Manually compile the NPU kernel
If you are using a non-official firmware, you need to update the kernel. The RKNPU driver package supports two major kernel versions: kernel-5.10 and kernel-6.1. You can confirm the exact version number in the Makefile in the root directory of the kernel. Follow these steps to update the kernel:
-
Download the rknpu_driver_0.9.6_20240322.tar.bz2 archive.
-
Extract the archive and overwrite the current kernel code directory with the rknpu driver code.
-
Recompile the kernel.
-
Flash the newly compiled kernel to the device.
-
-
RKLLM Runtime provides C/C++ programming interfaces for the Rockchip NPU platform to help users deploy RKLLM models and accelerate LLM applications.
git clone https://github.com/airockchip/rknn-llm.git