summaryrefslogtreecommitdiff
path: root/makima/src/tts/mod.rs
diff options
context:
space:
mode:
authorsoryu <soryu@soryu.co>2026-01-28 03:47:31 +0000
committersoryu <soryu@soryu.co>2026-01-28 03:47:31 +0000
commitc3de071511de5e8a8d63ea4ca47c815cb6450215 (patch)
treed843045e3b9dd1228e9ce5771abe4efbc574fbf3 /makima/src/tts/mod.rs
parent20be7ba19964a6eba60079c17399cf8ff6e60cea (diff)
downloadsoryu-c3de071511de5e8a8d63ea4ca47c815cb6450215.tar.gz
soryu-c3de071511de5e8a8d63ea4ca47c815cb6450215.zip
feat: add voice loading and selection for TTS cloning
Add voice reference audio loading so the TTS speak handler can perform voice cloning using reference WAV files from the voices/ directory. - Add voice.rs module: loads manifest.json and reference.wav for a given voice_id, decodes via symphonia, resamples to 24kHz for the TTS engine - Update speak.rs: resolve voice_id from the speak request (default "makima"), load reference audio, pass it to engine.generate() - Add voices/makima/README.md with instructions for obtaining reference audio (extraction from YouTube, recording, ffmpeg conversion) - Graceful fallback: if reference audio is missing, TTS proceeds without voice cloning using the model's default voice Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Diffstat (limited to 'makima/src/tts/mod.rs')
0 files changed, 0 insertions, 0 deletions