#

audio-augmentation

Here are 9 public repositories matching this topic...

KentoNishi / torch-pitch-shift

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation pitch-shift gpu-support torchaudio audio-augmentation

Updated Sep 25, 2024
Python

KentoNishi / torch-time-stretch

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

torch pytorch sound-processing augmentation gpu-support torchaudio time-stretch audio-augmentation

Updated Sep 5, 2022
Python

Lallapallooza / fast-audiomentations

⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.

audio python machine-learning gpu dsp pytorch triton data-augmentation audio-effects audio-augmentation augmentations audio-data-augmentation

Updated Jan 19, 2024
Python

zabir-nabil / torch-speech-dataloader

A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentations

speech torch audio-augmentation torch-dataloader pytorch-speech-dataloader gpu-augmentation speech-augmentation-gpu

Updated Nov 6, 2022
Python

lgpearson1771 / openwakeword-trainer

Train custom wake word models with openWakeWord. A granular 13-step pipeline with compatibility patches for torchaudio 2.10+, Piper TTS, and speechbrain. Generates tiny ONNX models (~200 KB) for real-time keyword detection — like building your own "Hey Siri" trigger. WSL2/Linux + CUDA required.

python text-to-speech deep-learning speech-recognition speech-to-text keyword-spotting voice-assistant wake-word-detection onnx on-device training-pipeline edge-ai audio-augmentation wsl2 speechbrain openwakeword piper-tts wake-word-training custom-wake-word

Updated Feb 13, 2026
Python

DBraun / audiotree

Audio data loading and augmentations in JAX

audio dataloader jax audio-augmentation

Updated Apr 19, 2026
Python

LarsMonstad / amt-augmentor

Python augmentation toolkit for Automatic Music Transcription datasets

audio music amt augmentation automatic-music-transcription audio-augmentation music-augmentation music-transc

Updated Apr 24, 2026
Python

moego0 / custom_KWS

End-to-end pipeline for training a custom keyword detection model with TensorFlow & TFLite expor

deep-learning tensorflow keras speech-recognition mfcc keyword-spotting cnn-model voice-detection audio-processing tflite audio-processing-with-python edge-ai audio-augmentation esc50

Updated Feb 24, 2026
Python

AndreasScharnetzki / EmotionClassifier

A Convolutional Neural Network that distinguishes between the speakers emotions. Comes with multiple preprocessors to improve the models performance.

natural-language-processing supervised-learning convolutional-neural-networks transfer-learning preprocessing human-computer-interaction audio-processing multi-class-classification audio-augmentation variable-length-data speech-emotion-classification

Updated Jan 20, 2022
Python

Improve this page

Add a description, image, and links to the audio-augmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-augmentation topic, visit your repo's landing page and select "manage topics."