-
Peking University
- Beijing
- @jiaxun71762860
Lists (18)
Sort Name ascending (A-Z)
Stars
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
slime is an LLM post-training framework for RL Scaling.
verl: Volcano Engine Reinforcement Learning for LLMs
[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
Tongyi Deep Research, the Leading Open-source Deep Research Agent
The Next Step Forward in Multimodal LLM Alignment
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
《动手学大模型Dive into LLMs》系列编程实践教程
Code for the paper "Evaluating Large Language Models Trained on Code"
Create beautiful, publication-quality books and documents from computational content.
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…
[ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incen…
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
✨✨Latest Advances on Multimodal Large Language Models
SGLang is a high-performance serving framework for large language models and multimodal models.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfac…
Collect every awesome work about r1!
Solve Visual Understanding with Reinforced VLMs
Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
A library for efficient similarity search and clustering of dense vectors.


