Skip to content
#

torchaudio

Here are 60 public repositories matching this topic...

Cascade is a production-ready, high-performance, and low-latency audio stream processing library designed for Voice Activity Detection (VAD). Built upon the excellent Silero VAD model, Cascade significantly reduces VAD processing latency while maintaining high accuracy through its 1:1:1 binding architecture and asynchronous streaming technology.

  • Updated Dec 22, 2025
  • Python
Qwen3-TTS-Daggr-UI

Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI nodes. Supports voice design (prompt-to-speech), voice cloning (zero-shot), and custom voice synthesis with multiple speakers and languages. Features lazy model loading to optimize memory, multi-model sizes (0.6B and 1.7B), ASR and support for various audio inputs.

  • Updated Feb 12, 2026
  • Python

Improve this page

Add a description, image, and links to the torchaudio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the torchaudio topic, visit your repo's landing page and select "manage topics."

Learn more