Extensions to YAML syntax for better python interaction
-
Updated
Jan 1, 2026 - Python
Extensions to YAML syntax for better python interaction
Real-time speaker diarization using straightforward, intuitive logic - High accuracy thanks to SpeechBrain/Pyannote-WeSpeaker models
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
Target speaker automatic speech recognition (TS-ASR)
Incremental learning for automatic speech recognition (ASR)
A Streamlit web app for speaker diarization and identification in audio files. Upload or record audio, transcribe conversations, and automatically segment and label speakers using reference samples. This app makes it easy to analyze multi-speaker audio, export transcripts, and identify "who spoke when" for meetings, interviews, and more.
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
Train custom wake word models with openWakeWord. A granular 13-step pipeline with compatibility patches for torchaudio 2.10+, Piper TTS, and speechbrain. Generates tiny ONNX models (~200 KB) for real-time keyword detection — like building your own "Hey Siri" trigger. WSL2/Linux + CUDA required.
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
Private, local-only desktop app for audio transcription with speaker diarization and AI-powered summarization. Windows. Python + React.
Speech transcription and speech diarization
Privacy-safe multi-modal authentication (Face + Voice) with optional offline STT.
HACKPUE 2025 - HORIZON'S H.E.R.A
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
A short test to determine the distribution of similarity scores for different SpeechBrain speaker identification models.
Templates for Automatic Speech Recognition, Diarization and Emotion Recognition.
Pipeline d’anonymisation vocale prêt pour VPC 2025, basé sur des x-vectors ECAPA (SpeechBrain) et une synthèse vocale simple pour préserver l’intelligibilité. Scripts de bout en bout pour construire le pool d’orateurs, anonymiser et évaluer localement (cosine, WER proxy).
Chat-Bot made using whisper live, speechbrain and open AI API
Add a description, image, and links to the speechbrain topic page so that developers can more easily learn about it.
To associate your repository with the speechbrain topic, visit your repo's landing page and select "manage topics."