-
University of Chinese Academy of Sciences
- 北京
- liuchengwei19@mails.ucas.ac.cn
-
MOSS-Audio-Tokenizer Public
Forked from OpenMOSS/MOSS-Audio-TokenizerMOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …
Python Apache License 2.0 UpdatedFeb 28, 2026 -
Qwen3-ASR Public
Forked from QwenLM/Qwen3-ASRQwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
Python Apache License 2.0 UpdatedJan 30, 2026 -
Qwen3-TTS Public
Forked from QwenLM/Qwen3-TTSQwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Python Apache License 2.0 UpdatedJan 22, 2026 -
-
Fun-Audio-Chat Public
Forked from FunAudioLLM/Fun-Audio-ChatFun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
Python Apache License 2.0 UpdatedDec 25, 2025 -
sam-audio Public
Forked from facebookresearch/sam-audioThe repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Python Other UpdatedDec 23, 2025 -
VibeVoice Public
Forked from microsoft/VibeVoiceOpen-Source Frontier Voice AI
Python MIT License UpdatedDec 17, 2025 -
Foundations-of-LLMs Public
Forked from ZJU-LLMs/Foundations-of-LLMsA book for Learning the Foundations of LLMs
Other UpdatedDec 12, 2025 -
unified-audio Public
Forked from alibaba/unified-audioAn Open-Source Project to Unify Audio Processing and Generation
Apache License 2.0 UpdatedOct 23, 2025 -
Qwen3 Public
Forked from QwenLM/Qwen3Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Python UpdatedOct 13, 2025 -
-
-
cgmm_mvdr_online Public
Forked from BUTSpeechFIT/cgmm_mvdr_onlineImplementation of CGMM-MVDR beamforming used for Clarity challenge