Qwen3-TTS Skill

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice model.

Installation

cd /home/brewuser/.nvm/versions/node/v24.13.0/lib/node_modules/clawdbot/skills/public/qwen-tts
bash scripts/setup.sh

This will:

scripts/tts.py "Ciao, questo è un test!" -l Italian -o test.wav

Play the audio:

aplay test.wav  # Linux
# or
ffplay test.wav  # Cross-platform

See SKILL.md for complete documentation.

Basic:

scripts/tts.py "Your text" -l Italian -o output.wav

List speakers:

scripts/tts.py --list-speakers

With emotion:

scripts/tts.py "Sono felice!" -i "Parla con entusiasmo" -l Italian

The skill is automatically available to OpenClaw once installed. OpenClaw can call:

cd skills/public/qwen-tts && scripts/tts.py "Text" -l Italian -o /tmp/audio.wav

Output path is printed to stdout (last line).

Uses Qwen3-TTS under Apache 2.0 license. Check model card for details: https://huggingface.co/Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice