Skip to content
View CodeGoat24's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report CodeGoat24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling

53 Updated Mar 2, 2026
Python 140 4 Updated Feb 13, 2026

Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"

Python 260 26 Updated Jan 20, 2026

Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.

Python 52 Updated Jan 7, 2026

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]

Python 23 Updated Dec 10, 2025

Rethinking Semantic-level Building Change Detection: Ensemble Learning and Dynamic Interaction

Python 16 Updated Jan 9, 2026

Official implementation of RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Python 46 1 Updated Nov 15, 2025

[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models

Python 50 Updated Feb 21, 2026

[NeurIPS 2025] Fractional Langevin Dynamics for Combinatorial Optimization via Polynomial-Time Escape

10 1 Updated Sep 30, 2025

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 190 6 Updated Feb 8, 2026

[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"

Python 182 10 Updated Feb 4, 2026

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

294 24 Updated Jan 7, 2026

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 945 58 Updated Dec 27, 2025
Python 15 1 Updated Nov 18, 2025

Webpage for dicache

JavaScript 2 Updated Oct 4, 2025

A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)

439 30 Updated Oct 3, 2025

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 240 11 Updated Feb 10, 2026

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Python 123 3 Updated Mar 2, 2026

[ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache

Python 55 2 Updated Jan 26, 2026

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

Python 231 25 Updated Aug 7, 2025

(ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’

Python 59 2 Updated Jan 26, 2026

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Python 187 6 Updated Nov 6, 2025

Official implementation of UnifiedReward & UnifiedReward-Think

Python 18 16 Updated Jun 18, 2025

Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding

HTML 6 2 Updated Aug 2, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,682 360 Updated Feb 26, 2026

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 835 42 Updated Dec 14, 2025

Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"

Python 45 2 Updated Apr 3, 2025

From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning - CVPR 2025

Python 16 3 Updated Mar 24, 2025

[ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Python 506 33 Updated Oct 25, 2025

We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, learn causal world knowledge, and tackle complex tasks through…

JavaScript 27 2 Updated Apr 7, 2025
Next