The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,359 290 Updated Jan 5, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,215 4,693 Updated Mar 8, 2026

tencent-ailab / MuQ

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 315 14 Updated Aug 4, 2025

TIGER-AI-Lab / EditReward

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]

Python 126 4 Updated Feb 6, 2026

salesforce / HIVE

Python 120 10 Updated Jan 27, 2025

ASLP-lab / SongFormer

Python 105 15 Updated Oct 16, 2025

xxayt / MGSV

[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"

Python 27 2 Updated Sep 9, 2025

yuanchenyang / smalldiffusion

Simple and readable code for training and sampling from diffusion models

Python 714 55 Updated Jun 14, 2025

mulab-mir / muchomusic

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 44 2 Updated Dec 3, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,417 4,778 Updated Jun 2, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 685 49 Updated Jun 5, 2025

CarlWangChina / MuDiT-MuSiT

MuDiT-MuSiT code

Python 3 Updated Jul 11, 2025

gfxdisp / asap

Fast and accurate Active SAmpling method for Pairwise comparisons

MATLAB 54 17 Updated Jul 1, 2024

gclef-cmu / music-arena

Python 37 3 Updated Feb 24, 2026

multimodal-art-projection / OmniBench

A project for tri-modal LLM benchmarking and instruction tuning.

Python 57 8 Updated Mar 27, 2025

HanqingWangAI / SSM-VLN

Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"

C++ 43 7 Updated Jul 31, 2021

i207M / Pomodoro-Improved-Strict-Workflow

实践番茄工作法：工作时屏蔽浪费时间的网站，休息时允许访问。A Chrome/Edge extension that helps you stay focused by blocking sites during work timers and letting you browse during break timers.

JavaScript 13 2 Updated Jul 26, 2022

YahboomTechnology / Raspbot-V2

Raspbot V2 AI Vision Robot Car for Raspberry Pi 5

17 12 Updated Sep 10, 2025

bojone / vae

a simple vae and cvae from keras

Python 1,381 377 Updated May 18, 2021

wiseodd / generative-models

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Python 7,498 2,023 Updated Mar 24, 2024

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,837 2,076 Updated Feb 21, 2026

bytedance / music_source_separation

Python 1,381 200 Updated Apr 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haiwen Xia Haiwen-Xia

Highlights

Block or report Haiwen-Xia

Lists (2)

2025_Spring

Deep Learning

Stars

Natooz / MidiTok

a43992899 / MARBLE

ChenLiu-1996 / figures4papers

qiuqiangkong / piano_transcription

christianazinn / MIDI-RWKV

qiuqiangkong / audio_understanding

SkyTNT / midi-model

ASLP-lab / SongEval

facebookresearch / sam-audio