Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
-
Updated
Sep 25, 2024 - Python
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations
Train custom wake word models with openWakeWord. A granular 13-step pipeline with compatibility patches for torchaudio 2.10+, Piper TTS, and speechbrain. Generates tiny ONNX models (~200 KB) for real-time keyword detection — like building your own "Hey Siri" trigger. WSL2/Linux + CUDA required.
Audio data loading and augmentations in JAX
Python augmentation toolkit for Automatic Music Transcription datasets
End-to-end pipeline for training a custom keyword detection model with TensorFlow & TFLite expor
A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.
Add a description, image, and links to the audio-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the audio-augmentation topic, visit your repo's landing page and select "manage topics."