Skip to content
View chenin-wang's full-sized avatar

Block or report chenin-wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Downloads videos and playlists from YouTube

C# 14,333 1,784 Updated Mar 4, 2026

A feature-rich command-line audio/video downloader

Python 149,958 12,152 Updated Mar 3, 2026

A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.

Python 13,496 1,448 Updated Mar 6, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 579 55 Updated Feb 18, 2026
Python 400 42 Updated Feb 20, 2026

Implementing complete SigLIP2 loss components: SILC/TIPS self-distillation, LocCa captioning loss, and Sigmoid loss, following HuggingFace Transformers and SigLIP2 research papers. Open source rese…

Jupyter Notebook 3 Updated Oct 10, 2025

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Python 402 24 Updated Feb 17, 2026

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,099 138 Updated Dec 20, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,868 644 Updated Mar 5, 2026

[arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

Python 62 3 Updated Dec 17, 2025

Code for the Molmo2 Vision-Language Model

Python 361 21 Updated Mar 4, 2026

A version of verl to support diverse tool use

Python 896 74 Updated Mar 2, 2026

Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

Python 15 1 Updated Feb 8, 2026

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 450 30 Updated Sep 18, 2025

Official Repository of Native Parallel Reasoner

Python 102 18 Updated Feb 5, 2026

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 908 91 Updated Jan 14, 2026

N-dimensional Rotary Position Embeddings for PyTorch

Python 83 3 Updated Feb 14, 2024

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 190 6 Updated Feb 8, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 12,953 1,235 Updated Mar 6, 2026

Fully Open Framework for Democratized Multimodal Training

Python 757 60 Updated Dec 27, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,923 230 Updated Mar 7, 2026

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,515 110 Updated May 27, 2025
Python 718 20 Updated Feb 5, 2026

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,282 841 Updated Dec 22, 2025

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 698 26 Updated Sep 24, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,528 1,651 Updated Jan 30, 2026

Simple & Scalable Pretraining for Neural Architecture Research

Python 309 31 Updated Dec 6, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,681 360 Updated Feb 26, 2026

Awesome Unified Multimodal Models

1,134 36 Updated Feb 6, 2026

Open-source unified multimodal model

Python 5,719 504 Updated Oct 27, 2025
Next