Skip to content
View waltstephen's full-sized avatar

Highlights

  • Pro

Block or report waltstephen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
waltstephen/README.md

Hi there, I'm Yijia Fan (WaltStephen)! 👋

Current Status: Research Intern at MSRA & Undergraduate at SYSU

Welcome to my GitHub profile! I am a Junior Computer Science student at Sun Yat-sen University (SYSU) and a Research Intern at Microsoft Research Asia (MSRA).

My long-term goal is to build a unified world model that seamlessly integrates understanding and generation across modalities and agents.

📢 I am seeking PhD opportunities for Fall 2027. If you are interested, please contact me directly!


🎓 About Me

  • Education: B.Eng. in Computer Science, Sun Yat-sen University (Sept 2023 - Present).
  • Research Labs:
    • Microsoft Research Asia (MSRA), Shanghai ML Group.
    • HCP Lab, Sun Yat-sen University.
  • Role: Reviewer for CVPR 2026, AAAI 2026, ICLR 2026.
  • Interests beyond tech: Philosophy (Kant, Nietzsche), Literature (Tolstoy, Kafka), and the intersection of humanity and technology. 📚🤔

🔬 Research Interests

My research centers on advancing multi-agent collaborative intelligence by integrating Visual Language Models (VLMs) and Large Language Models (LLMs) with Reinforcement Learning (RL).

  • Multi-Agent Systems: Knowledge-aware coordination, Bayesian bandits, Game-theoretic uncertainty trading.
  • Policy Optimization: PPO, MAB, GRPO, and MAPPO-like RL frameworks.
  • Generative Models: Video generation (Hierarchical VAE), 3D generation, and Unified World Models.
  • VLM Architectures: Discretized representations (VQ), Post-training with RFT (LLaVA).

📝 Selected Publications

Check out my Google Scholar for the full list.

  • [AAAI 2026] Cost-Effective Communication: An Auction-based Method for Language Agent Interaction
    Yijia Fan, Jusheng Zhang, Kaitong Cai, et al.
  • [AAAI 2026] 3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment
    Yijia Fan, Jusheng Zhang, Kaitong Cai, et al.
  • [NeurIPS 2025] GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning
    Jusheng Zhang, Yijia Fan, et al.
  • [NeurIPS 2025] Tri-MARF: A Tri-Modal Multi-Agent Responsive Framework for Comprehensive 3D Object Annotation
    Jusheng Zhang, Yijia Fan, et al.
  • [EMNLP 2025] CCG: Rare-Label Prediction via Neural SEM–Driven Causal Game
    Yijia Fan, Jusheng Zhang, et al.

💼 Experience Highlights

  • Microsoft Research Asia (Shanghai ML Group) | July 2025 – Present
    • Improving video generation using hierarchical VAE and exploring next-gen discretized VLM projects.
    • Training Unified video generation/understanding models on multi-node clusters (DeepSpeed).
  • HCP Lab (SYSU) | July 2024 – Present
    • Exploring long-context LLM processing via diffusion models and curiosity-based game systems for few-label classification.

🛠️ Skills & Tools

  • Languages: Python (PyTorch), C/C++. 🐍💻
  • Frameworks: PyTorch, CUDA, DeepSpeed, PyTorch Lightning. ⚡
  • Tools: Linux, LaTeX, Git. 🐧📄

Top Languages

🌟 Let’s Connect!

Whether you’re into AI research, philosophy, or literature, feel free to reach out!

📫 Email: fanyj28@mail2.sysu.edu.cn
📍 Location: Guangzhou / Shanghai, China

Popular repositories Loading

  1. GLUE_baseline_pytorch GLUE_baseline_pytorch Public

    GELU_baseline is built based on a higher version of pytorch and transformers library, so you no longer need to manually download the dataset

    Python 6

  2. ArxivTrackerAssistant ArxivTrackerAssistant Public

    这是一个简单的由python构建的arxiv论文搜索小程序,你可以搜索特定人/单位的最近20篇文章,并调用geimini-2.0-flash快速总结,可视化在mac上已经成功进行

    Python 2

  3. waltstephen waltstephen Public

    1

  4. awesome-kan awesome-kan Public

    Forked from mintisan/awesome-kan

    A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…

    1

  5. YatPotato YatPotato Public

    Forked from ouyangyipeng/YatPotato

    A Tomato Clock with multiple interesting functions.

    JavaScript

  6. KABB KABB Public

    Forked from HCP-AI-Research-Lab/KABB

    [ICML2025] KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems

    Python