StyleTTS2 (Kokoro-82M Fine-tuning Fork)

Note: This fork's main branch is a patched version of the original yl4579/StyleTTS2 repository. It is maintained specifically as a git submodule for the kokoro-deutsch training recipe.

Why this fork exists

The Kokoro-82M TTS model is based on the StyleTTS 2 architecture. However, to fine-tune it using its published HuggingFace weights, several modifications to the upstream training code are required:

PyTorch API Migration (Critical): Migrated torch.nn.utils.weight_norm and spectral_norm to the modern torch.nn.utils.parametrizations API. This is mandatory for compatibility with Kokoro's inference pipeline (KModel).
Kokoro Symbols: Integrated Kokoro's specific 178-token IPA vocabulary (kokoro_symbols.py).
Bug Fixes:
- Fixed an unsqueeze shape mismatch crash at epoch boundaries involving F0 tensors.
- Fixed checkpoint saving order to prevent data loss if TensorBoard audio generation fails.
- Fixed missing .train() mode re-initializations after checkpoint loading in Stage 2.
- Removed hardcoded ipdb breakpoints that caused silent hangs.
- Added a monkey-patch for torch.load weights_only=False for PyTorch 2.6+ compatibility.
- Filtered long phoneme sequences (> 510 tokens) to prevent PLBERT position embedding overflows.

Usage

This repository is not meant to be used standalone.

Please see the kokoro-deutsch repository for the full end-to-end training guide, dataset preparation scripts, and voicepack extraction tools.

For the original StyleTTS2 project and documentation, please visit the upstream repository.

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
Colab		Colab
Configs		Configs
Data		Data
Demo		Demo
Modules		Modules
Utils		Utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
kokoro_symbols.py		kokoro_symbols.py
kokoro_tb_utils.py		kokoro_tb_utils.py
losses.py		losses.py
meldataset.py		meldataset.py
models.py		models.py
optimizers.py		optimizers.py
requirements.txt		requirements.txt
text_utils.py		text_utils.py
train_finetune.py		train_finetune.py
train_finetune_accelerate.py		train_finetune_accelerate.py
train_first.py		train_first.py
train_second.py		train_second.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StyleTTS2 (Kokoro-82M Fine-tuning Fork)

Why this fork exists

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

StyleTTS2 (Kokoro-82M Fine-tuning Fork)

Why this fork exists

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages