Lists (1)
Sort Name ascending (A-Z)
Stars
Speech To Speech: an effort for an open-sourced and modular GPT4-o
VoiceBench: Benchmarking LLM-Based Voice Assistants
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Implementation of the SSR algorithm of the paper "LLMs Reproduce Human Purchase Intent via Semantic Similarity Elicitation of Likert Ratings"
practice made claude perfect
This is the official repository for DiPro, highlighted as a Spotlight at NeurIPS 2025.
Train your AI self, amplify you, bridge the world
This is an AI Agent based on FFmpeg. You just need to say a sentence, and it will help you process media files.
CLEF: Clinically-Guided Contrastive Learning for Electrocardiogram Foundation Models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing
Plot standard multi lead ECG/EKG chart with Python
Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取
Reference PyTorch implementation and models for DINOv3
🤗 smolagents: a barebones library for agents that think in code.
Generate PA through waveform reconstruction using the VAE.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
XLeRobot: Practical Dual-Arm Mobile Home Robot for $660
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Resource library for getting started with deep learning work using electrocardiograms
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
Official implementation of "MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation"
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases