qwen-tts

qwen-tts

即用 qwen-tts v1.0.0

技能概述

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.

安装方法

**First-time setup** (one-time):

cd skills/public/qwen-tts bash scripts/setup.sh

This creates a local virtual environment and installs `qwen-tts` package (~500MB).

**Note:** First synthesis downloads ~1.7GB model from Hugging Face automatically.

使用示例

scripts/tts.py "Ciao, come va?" -l Italian -o output.wav
scripts/tts.py "Sono felice!" -i "Parla con entusiasmo" -l Italian -o happy.wav
scripts/tts.py "Hello world" -s Ryan -l English -o hello.wav
scripts/tts.py "Benvenuto nel futuro del text-to-speech" -l Italian -o welcome.wav
scripts/tts.py "Sono molto felice di vederti!" -i "Parla con entusiasmo e gioia" -l Italian -o happy.wav
scripts/tts.py "Hello, nice to meet you" -s Ryan -l English -o ryan.wav
scripts/tts.py --list-speakers