MeloTTS

This document explains how to run the MeloTTS example application on a host device equipped with the Radxa AICore AX-M1.

Download Example Application Repository

Use huggingfcae-cli to download the example application repository.

Host

pip3 install -U "huggingface_hub[cli]"
huggingface-cli download AXERA-TECH/MeloTTS --local-dir ./MeloTTS
cd MeloTTS

Example Usage

Install Python Dependencies

Host

cd python
pip3 install -r requirements.txt
pip3 install https://github.com/AXERA-TECH/pyaxengine/releases/download/0.1.3.rc1/axengine-0.1.3-py3-none-any.whl

Install System Dependencies

Host

sudo apt-get install libsndfile1-dev libmecab-dev

Copy nltk_data File

Host

cp -r ../nltk_data ~/

Model Inference

Host

python3 melotts.py -s "Radxa Computer is a Chinese tech company specializing in developing and manufacturing single-board computers, headquartered in Shenzhen." -e ../encoder-onnx/encoder-en.onnx -d ../decoder-ax650/decoder-en.axmodel

tip

Please modify the melotts.py file's 164th line of g-{language.lower()}.bin path

(.venv) rock@rock-5b-plus:~/ssd/axera/MeloTTS/python$ python3 melotts.py -s "Radxa Computer is a Chinese tech company specializing in developing and manufacturing single-board computers, headquartered in Shenzhen." -e ../encoder-onnx/encoder-zh.onnx -d ../decoder-ax650/decoder-zh.axmodel
[INFO] Available providers:  ['AXCLRTExecutionProvider']
sentence: Radxa Computer is a Chinese tech company specializing in developing and manufacturing single-board computers, headquartered in Shenzhen.
sample_rate: 44100
encoder: ../encoder-onnx/encoder-zh.onnx
decoder: ../decoder-ax650/decoder-zh.axmodel
language: ZH_MIX_EN
 > Text split to sentences.
Radxa Computer is a Chinese tech company specializing in developing and manufacturing single-board computers, headquartered in Shenzhen.
 > ===========================
split_sentences_into_pieces take 0.5743503570556641ms
[INFO] Using provider: AXCLRTExecutionProvider
[INFO] SOC Name: AX650N
[INFO] VNPU type: VNPUType.DISABLED
[INFO] Compiler version: 3.3 3251425d
load models take 2446.855306625366ms

Sentence[0]: Radxa Computer is a Chinese tech company specializing in developing and manufacturing single-board computers, headquartered in Shenzhen.
None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and only tokenizers, configuration and file/data utilities can be used.
Load language module take 7701.969146728516ms
Building prefix dict from the default dictionary ...
Loading model from cache /tmp/jieba.cache
Loading model cost 0.809 seconds.
Prefix dict has been built successfully.
encoder run take 113.38ms
Decode slice[0]: decoder run take 94.97ms
Decode slice[1]: decoder run take 92.53ms
Decode slice[2]: decoder run take 92.46ms
Decode slice[3]: decoder run take 92.45ms
Decode slice[4]: decoder run take 92.44ms
Decode slice[5]: decoder run take 92.38ms

Sentence[1]: headquartered in Shenzhen.
Load language module take 0.013828277587890625ms
encoder run take 52.27ms
Decode slice[0]: decoder run take 93.02ms
Decode slice[1]: decoder run take 92.62ms
Save to output.wav

Download Example Application Repository​

Example Usage​

Install Python Dependencies​

Install System Dependencies​

Copy nltk_data File​

Model Inference​

Download Example Application Repository

Example Usage

Install Python Dependencies

Install System Dependencies

Copy nltk_data File

Model Inference