RUCAIBox

All

180 repositories

SWE-World
Public
HTML
•
MIT License
•1•31•1•0•Updated Mar 6, 2026Mar 6, 2026
SWE-Master
Public
HTML
•
MIT License
•2•44•4•0•Updated Feb 28, 2026Feb 28, 2026
PTS
Public
[ICLR'26] The official GitHub page for ''Unleashing Perception-Time Scaling to Multimodal Reasoning Models''
MIT License
•1•0•0•0•Updated Feb 28, 2026Feb 28, 2026
RecPilot
Public
Python
•1•3•0•0•Updated Feb 26, 2026Feb 26, 2026
DxEvolve
Public
Python
•
MIT License
•0•1•0•0•Updated Feb 25, 2026Feb 25, 2026
GenCI
Public
This is the official PyTorch implementation for the paper
Python
•
MIT License
•1•0•0•0•Updated Jan 23, 2026Jan 23, 2026
VIPER
Public
The official GitHub page for ''Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning''
MIT License
•1•0•0•1•Updated Jan 15, 2026Jan 15, 2026
Revisiting-Visual-CoT
Public
1•4•0•0•Updated Dec 26, 2025Dec 26, 2025
CIR
Public
Python
•1•14•0•0•Updated Nov 11, 2025Nov 11, 2025
POPEv2
Public
[AAAI'26 Oral] The official GitHub page for ''Analyzing and Mitigating Object Hallucination: A Training Bias Perspective''
MIT License
•1•0•0•0•Updated Nov 10, 2025Nov 10, 2025
DEPO
Public
About The official GitHub page for ''Towards High Data Efficiency in Reinforcement Learning with Verifiable Reward'' Resources
Python
•0•0•0•0•Updated Sep 25, 2025Sep 25, 2025
CAFE
Public
A novel two-stage coarse-to-fine information-seeking method to enhance the multi-document question-answering capabilities of LLMs.
Apache License 2.0
•0•4•1•0•Updated Sep 5, 2025Sep 5, 2025
Sticker-TTS
Public
A test-time scaling framework that coordinates three collaborative LRMs to iteratively explore and refine solutions guided by historical attempts.
Apache License 2.0
•0•1•1•0•Updated Sep 2, 2025Sep 2, 2025
POPE
Public
The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
Python
•
MIT License
•15•253•0•0•Updated Aug 21, 2025Aug 21, 2025
Passk_Training
Public
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
Python
•6•112•8•0•Updated Aug 15, 2025Aug 15, 2025
Slow_Thinking_with_LLMs
Public
A series of technical report on Slow Thinking with LLM
Python
•41•761•16•1•Updated Aug 13, 2025Aug 13, 2025
MMATH
Public
Python
•
MIT License
•2•3•1•0•Updated Aug 8, 2025Aug 8, 2025
R1-Searcher
Public
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Python
•
MIT License
•47•695•19•2•Updated Aug 5, 2025Aug 5, 2025
MTGRec
Public
Python
•1•15•0•0•Updated Jul 24, 2025Jul 24, 2025
Inference-Efficiency-Evaluation
Public
Python
•1•0•0•0•Updated Jun 29, 2025Jun 29, 2025
LLMBox
Public
A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
Python
•
MIT License
•106•852•0•4•Updated Jun 16, 2025Jun 16, 2025
UTGRec
Public
Python
•2•8•1•0•Updated Jun 11, 2025Jun 11, 2025
SimpleDeepSearcher
Public
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Python
•
MIT License
•8•118•1•0•Updated Jun 3, 2025Jun 3, 2025
RioRAG
Public
Python
•1•2•0•0•Updated Jun 2, 2025Jun 2, 2025
OlymMATH
Public
The OlymMATH dataset
Python
•
MIT License
•0•24•0•0•Updated Jun 1, 2025Jun 1, 2025
ManuSearch
Public
Python
•2•33•1•0•Updated May 27, 2025May 27, 2025
Virgo
Public
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
Python
•4•109•0•0•Updated May 27, 2025May 27, 2025
DeepRec
Public
Python
•0•6•0•0•Updated May 26, 2025May 26, 2025
R1-Searcher-plus
Public
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
Python
•
MIT License
•2•75•5•0•Updated May 25, 2025May 25, 2025
CCFRec
Public
[KDD'25] Code of "Bridging Textual-Collaborative Gap through Semantic Codes for Sequential Recommendation".
Python
•
MIT License
•4•8•0•0•Updated May 25, 2025May 25, 2025