maksimstw

Taiwei Shi maksimstw

Achievements

verl-project/verl verl-project/verl Public

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20.6k 3.6k
microsoft/experiential_rl microsoft/experiential_rl Public

The official codebase for "Experiential Reinforcement Learning" - https://arxiv.org/pdf/2602.13949v1

Python 62 5
rllm-org/rllm rllm-org/rllm Public

Democratizing Reinforcement Learning for LLMs

Python 5.4k 539
hiyouga/LlamaFactory hiyouga/LlamaFactory Public

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69.9k 8.5k
BytedTsinghua-SIA/MemAgent BytedTsinghua-SIA/MemAgent Public

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 984 69
limenlp/safer-instruct limenlp/safer-instruct Public

This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"

17 1