Trying to learn machine learning for audio effects. This is my own take at coding audio effects using machine learning. In OCaml.
It started with the LSTM-based approach described in
In order to learn an effect you need both an original sound file (say
source.wav) and a processed sound file (say target.wav). You can start
learning with
leffect -s source.wav -t target.wavThis will learn the effect. While doing so it will also produce an output.wav
file (name can be changed with -o) resulting of the processing of the input
file by the current network. Some other useful parameters are
--rate: the learning rate--size: the size of the network--play: play the output in realtime
At the end, it produces an effect.json file (the name can be changed with the
--json flag).
In order to process a file with an already learned effect (whose parameters are
stored in effect.json), you can simply do
leffect -i file.wav -o output.wav