Lists (2)
Sort Name ascending (A-Z)
Stars
MIDI / symbolic music tokenizers for Deep Learning models 🎶
State-of-the-art pretrained music models for training, evaluation, inference
My Python scripts to make high-quality figures for publications in top AI conferences and journals.
Midi event transformer for symbolic music generation
A song aesthetic evaluation toolkit trained on SongEval.
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
SGLang is a high-performance serving framework for large language models and multimodal models.
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]
[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"
Simple and readable code for training and sampling from diffusion models
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Unified automatic quality assessment for speech, music, and sound.
Fast and accurate Active SAmpling method for Pairwise comparisons
A project for tri-modal LLM benchmarking and instruction tuning.
Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"
实践番茄工作法:工作时屏蔽浪费时间的网站,休息时允许访问。A Chrome/Edge extension that helps you stay focused by blocking sites during work timers and letting you browse during break timers.
Raspbot V2 AI Vision Robot Car for Raspberry Pi 5
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.