Skip to content
Change the repository type filter

All

    Repositories list

    • SWE-World

      Public
      HTML
      MIT License
      13110Updated Mar 6, 2026Mar 6, 2026
    • HTML
      MIT License
      24440Updated Feb 28, 2026Feb 28, 2026
    • PTS

      Public
      [ICLR'26] The official GitHub page for ''Unleashing Perception-Time Scaling to Multimodal Reasoning Models''
      MIT License
      1000Updated Feb 28, 2026Feb 28, 2026
    • RecPilot

      Public
      Python
      1300Updated Feb 26, 2026Feb 26, 2026
    • DxEvolve

      Public
      Python
      MIT License
      0100Updated Feb 25, 2026Feb 25, 2026
    • GenCI

      Public
      This is the official PyTorch implementation for the paper
      Python
      MIT License
      1000Updated Jan 23, 2026Jan 23, 2026
    • VIPER

      Public
      The official GitHub page for ''Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning''
      MIT License
      1001Updated Jan 15, 2026Jan 15, 2026
    • Revisiting-Visual-CoT

      Public
      1400Updated Dec 26, 2025Dec 26, 2025
    • CIR

      Public
      Python
      11400Updated Nov 11, 2025Nov 11, 2025
    • POPEv2

      Public
      [AAAI'26 Oral] The official GitHub page for ''Analyzing and Mitigating Object Hallucination: A Training Bias Perspective''
      MIT License
      1000Updated Nov 10, 2025Nov 10, 2025
    • DEPO

      Public
      About The official GitHub page for ''Towards High Data Efficiency in Reinforcement Learning with Verifiable Reward'' Resources
      Python
      0000Updated Sep 25, 2025Sep 25, 2025
    • CAFE

      Public
      A novel two-stage coarse-to-fine information-seeking method to enhance the multi-document question-answering capabilities of LLMs.
      Apache License 2.0
      0410Updated Sep 5, 2025Sep 5, 2025
    • A test-time scaling framework that coordinates three collaborative LRMs to iteratively explore and refine solutions guided by historical attempts.
      Apache License 2.0
      0110Updated Sep 2, 2025Sep 2, 2025
    • POPE

      Public
      The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
      Python
      MIT License
      1525300Updated Aug 21, 2025Aug 21, 2025
    • The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
      Python
      611280Updated Aug 15, 2025Aug 15, 2025
    • Slow_Thinking_with_LLMs

      Public
      A series of technical report on Slow Thinking with LLM
      Python
      41761161Updated Aug 13, 2025Aug 13, 2025
    • MMATH

      Public
      Python
      MIT License
      2310Updated Aug 8, 2025Aug 8, 2025
    • R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
      Python
      MIT License
      47695192Updated Aug 5, 2025Aug 5, 2025
    • MTGRec

      Public
      Python
      11500Updated Jul 24, 2025Jul 24, 2025
    • Python
      1000Updated Jun 29, 2025Jun 29, 2025
    • LLMBox

      Public
      A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
      Python
      MIT License
      10685204Updated Jun 16, 2025Jun 16, 2025
    • UTGRec

      Public
      Python
      2810Updated Jun 11, 2025Jun 11, 2025
    • SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
      Python
      MIT License
      811810Updated Jun 3, 2025Jun 3, 2025
    • RioRAG

      Public
      Python
      1200Updated Jun 2, 2025Jun 2, 2025
    • OlymMATH

      Public
      The OlymMATH dataset
      Python
      MIT License
      02400Updated Jun 1, 2025Jun 1, 2025
    • Python
      23310Updated May 27, 2025May 27, 2025
    • Virgo

      Public
      Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
      Python
      410900Updated May 27, 2025May 27, 2025
    • DeepRec

      Public
      Python
      0600Updated May 26, 2025May 26, 2025
    • R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
      Python
      MIT License
      27550Updated May 25, 2025May 25, 2025
    • CCFRec

      Public
      [KDD'25] Code of "Bridging Textual-Collaborative Gap through Semantic Codes for Sequential Recommendation".
      Python
      MIT License
      4800Updated May 25, 2025May 25, 2025