You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: Eng_docs.md
+28-2Lines changed: 28 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,5 @@
1
1
# SoftVC VITS Singing Voice Conversion
2
+
2
3
## Updates
3
4
> According to incomplete statistics, it seems that training with multiple speakers may lead to **worsened leaking of voice timbre**. It is not recommended to train models with more than 5 speakers. The current suggestion is to try to train models with only a single speaker if you want to achieve a voice timbre that is more similar to the target.
4
5
> Fixed the issue with unwanted staccato, improving audio quality by a decent amount.\
@@ -8,11 +9,11 @@
8
9
9
10
## Model Overview
10
11
A singing voice coversion (SVC) model, using the SoftVC encoder to extract features from the input audio, sent into VITS along with the F0 to replace the original input to acheive a voice conversion effect. Additionally, changing the vocoder to [NSF HiFiGAN](https://github.com/openvpi/DiffSinger/tree/refactor/modules/nsf_hifigan) to fix the issue with unwanted staccato.
12
+
11
13
## Notice
12
14
+ The current branch is the 32kHz version, which requires less vram during inferencing, as well as faster inferencing speeds, and datasets for said branch take up less disk space. Thus the 32 kHz branch is recommended for use.
13
15
+ If you want to train 48 kHz variant models, switch to the [main branch](https://github.com/innnky/so-vits-svc/tree/main).
14
-
## Colab notebook script for dataset creation and training.
15
-
[colab training notebook](https://colab.research.google.com/drive/1rCUOOVG7-XQlVZuWRAj5IpGrMM8t07pE?usp=sharing)
## Colab notebook script for dataset creation and training.
37
+
[colab training notebook](https://colab.research.google.com/drive/1rCUOOVG7-XQlVZuWRAj5IpGrMM8t07pE?usp=sharing)
35
38
36
39
## Dataset preparation
37
40
All that is required is that the data be put under the `dataset_raw` folder in the structure format provided below.
@@ -81,3 +84,26 @@ Use [inference_main.py](inference_main.py)
81
84
+ Change `clean_names` to the output file name.
82
85
+ Use `trans` to edit the pitch shifting amount (semitones).
83
86
+ Change `spk_list` to the speaker name.
87
+
88
+
## Onnx Exporting.
89
+
### **When exporting Onnx, please make sure you re-clone the whole repository!!!**
90
+
Use [onnx_export.py](onnx_export.py)
91
+
+ Create a new folder called `checkpoints`.
92
+
+ Create a project folder in `checkpoints` folder with the desired name for your project, let's use `myproject` as example. Folder structure looks like `./checkpoints/myproject`.
93
+
+ Rename your model to `model.pth`, rename your config file to `config.json` then move them into `myproject` folder.
94
+
+ Modify [onnx_export.py](onnx_export.py) where `path = "NyaruTaffy"`, change `NyaruTaffy` to your project name, here it will be `path = "myproject"`.
95
+
+ Run [onnx_export.py](onnx_export.py)
96
+
+ Once it finished, a `model.onnx` will be generated in `myproject` folder, that's the model you just exported.
97
+
+ Notice: if you want to export a 48K model, please follow the instruction below or use `model_onnx_48k.py` directly.
98
+
+ Open [model_onnx.py](model_onnx.py) and change `hps={"sampling_rate": 32000...}` to `hps={"sampling_rate": 48000}` in class `SynthesizerTrn`.
99
+
+ Open [nvSTFT](/vdecoder/hifigan/nvSTFT.py) and replace all `32000` with `48000`
+ All training function and transformation are removed, only if they are all removed you are actually using Onnx.
103
+
104
+
## Gradio (WebUI)
105
+
Use [sovits_gradio.py](sovits_gradio.py) to run Gradio WebUI
106
+
+ Create a new folder called `checkpoints`.
107
+
+ Create a project folder in `checkpoints` folder with the desired name for your project, let's use `myproject` as example. Folder structure looks like `./checkpoints/myproject`.
108
+
+ Rename your model to `model.pth`, rename your config file to `config.json` then move them into `myproject` folder.
0 commit comments