Commit 80b553c
authored
Document G2P phoneme mismatch limitation in Kokoro (#414)
## Summary
- Documents the grapheme-to-phoneme (G2P) conversion limitation
affecting Kokoro and KittenTTS pronunciation quality
- Adds Known Issues section explaining the espeak vs
graphemes_to_phonemes_en_us mismatch
- References PR #409 where pronunciation issues were discovered during
KittenTTS testing
## Context
During KittenTTS integration testing (PR #409), @Josscii discovered
pronunciation issues with words like "hello" and "day". @Alex-Wengg
identified the root cause: the original Kokoro and KittenTTS models were
trained using espeak for phoneme generation, but FluidAudio uses
`graphemes_to_phonemes_en_us` from HuggingFace (PeterReid).
This mismatch causes some words to be pronounced incorrectly because the
phoneme outputs don't match what the models expect.
## The Limitation
- **Current G2P**: `graphemes_to_phonemes_en_us` (HuggingFace:
PeterReid/graphemes_to_phonemes_en_us)
- **Models trained with**: espeak phonemes
- **Why we can't use espeak**: Licensing constraints
- **Impact**: Affects all TTS models using the shared Kokoro G2P
pipeline
- **What's needed**: An espeak-compatible alternative with a permissive
license
## Test plan
- [x] Documentation builds correctly
- [x] Links to PR #409 comment thread work
- [x] Known Issues section is clear and actionable
📝 Generated with [Claude Code](https://claude.com/claude-code)
<!-- devin-review-badge-begin -->
---
<a href="https://app.devin.ai/review/fluidinference/fluidaudio/pull/414"
target="_blank">
<picture>
<source media="(prefers-color-scheme: dark)"
srcset="https://static.devin.ai/assets/gh-open-in-devin-review-dark.svg?v=1">
<img
src="https://static.devin.ai/assets/gh-open-in-devin-review-light.svg?v=1"
alt="Open with Devin">
</picture>
</a>
<!-- devin-review-badge-end -->1 parent b80d364 commit 80b553c
1 file changed
+2
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
112 | 112 | | |
113 | 113 | | |
114 | 114 | | |
| 115 | + | |
| 116 | + | |
115 | 117 | | |
116 | 118 | | |
117 | 119 | | |
| |||
0 commit comments