Skip to content

Commit 762a3de

Browse files
committed
Make Mini the default KittenTTS variant
Add 'kitten' backend option that defaults to Mini (82M params) instead of requiring explicit 'kitten-mini' flag. Users can still use 'kitten-nano' for the smaller 15M model. Rationale: - Mini has better quality (3.13x RTF vs 1.12x for Nano) - Mini supports speed control, Nano does not - 82M is still relatively small and runs well on Apple Silicon Changes: - Add 'kitten' and 'kittentts' backend options → .kittenTts(.mini) - Update help text to show 'kitten (Mini 82M)' option - KittenTtsManager already defaults to .mini in its initializer
1 parent 784b581 commit 762a3de

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

Sources/FluidAudioCLI/Commands/TTSCommand.swift

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -201,6 +201,8 @@ public struct TTS {
201201
backend = .kokoro
202202
case "pocket", "pockettts":
203203
backend = .pocketTts
204+
case "kitten", "kittentts":
205+
backend = .kittenTts(.mini) // Default to Mini (82M)
204206
case "kitten-nano", "kittennano":
205207
backend = .kittenTts(.nano)
206208
case "kitten-mini", "kittenmini":
@@ -720,7 +722,7 @@ public struct TTS {
720722
Options:
721723
--output, -o Output WAV path (default: output.wav)
722724
--voice, -v Voice name (default: af_heart for Kokoro, alba for PocketTTS, expr-voice-3-f for KittenTTS)
723-
--backend TTS backend: kokoro (default), pocket, kitten-nano, kitten-mini
725+
--backend TTS backend: kokoro (default), pocket, kitten (Mini 82M), kitten-nano, kitten-mini
724726
--speed Speech speed multiplier (KittenTTS Mini only, default: 1.0)
725727
--lexicon, -l Custom pronunciation lexicon file (word=phonemes format, Kokoro only)
726728
--benchmark Run a predefined benchmarking suite with multiple sentences

0 commit comments

Comments
 (0)