Skip to content
View Haiwen-Xia's full-sized avatar

Highlights

  • Pro

Block or report Haiwen-Xia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MIDI / symbolic music tokenizers for Deep Learning models 🎶

Python 852 98 Updated Mar 2, 2026

State-of-the-art pretrained music models for training, evaluation, inference

Python 165 16 Updated Jan 20, 2026

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 554 48 Updated Mar 4, 2026
Python 17 3 Updated Jan 24, 2026

Midi event transformer for symbolic music generation

Python 349 54 Updated Dec 31, 2024

A song aesthetic evaluation toolkit trained on SongEval.

Python 286 25 Updated Jun 15, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,359 290 Updated Jan 5, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,215 4,693 Updated Mar 8, 2026

Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".

Python 315 14 Updated Aug 4, 2025

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]

Python 126 4 Updated Feb 6, 2026
Python 120 10 Updated Jan 27, 2025
Python 105 15 Updated Oct 16, 2025

[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"

Python 27 2 Updated Sep 9, 2025

Simple and readable code for training and sampling from diffusion models

Python 714 55 Updated Jun 14, 2025

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 44 2 Updated Dec 3, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,417 4,778 Updated Jun 2, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 685 49 Updated Jun 5, 2025

MuDiT-MuSiT code

Python 3 Updated Jul 11, 2025

Fast and accurate Active SAmpling method for Pairwise comparisons

MATLAB 54 17 Updated Jul 1, 2024
Python 37 3 Updated Feb 24, 2026

A project for tri-modal LLM benchmarking and instruction tuning.

Python 57 8 Updated Mar 27, 2025

Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"

C++ 43 7 Updated Jul 31, 2021

实践番茄工作法:工作时屏蔽浪费时间的网站,休息时允许访问。A Chrome/Edge extension that helps you stay focused by blocking sites during work timers and letting you browse during break timers.

JavaScript 13 2 Updated Jul 26, 2022

Raspbot V2 AI Vision Robot Car for Raspberry Pi 5

17 12 Updated Sep 10, 2025

a simple vae and cvae from keras

Python 1,381 377 Updated May 18, 2021

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Python 7,498 2,023 Updated Mar 24, 2024

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,837 2,076 Updated Feb 21, 2026
Next