Skip to content

yahooo-m/Streaming-video-understanding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 

Repository files navigation

🎬 Streaming Video Understanding Papers

πŸ“š A Curated Collection of Research Papers on Streaming, Online, and Real-time Video Understanding with LMMs

Awesome Papers Updated

Focus Areas: Streaming Perception β€’ Proactive QA β€’ Real-time Memory β€’ KV Compression β€’ Token-efficient Long Video Modeling


πŸ“– Table of Contents


πŸš€ Streaming Video LLMs

πŸ† Model πŸ“… Year πŸ“„ Paper πŸ’» Code 🌟 Highlights
Streamo 2025 πŸ“„ arXiv - Latest streaming model
StreamingVLM 2025 πŸ“„ arXiv - Advanced streaming architecture
Flash-VStream 2024 πŸ“„ arXiv - Flash attention for streaming
StreamForest 2025 πŸ“„ arXiv - Hierarchical streaming
LiveVLM 2025 πŸ“„ arXiv - Real-time vision-language
VideoChat-Online 2024 πŸ“„ arXiv - Conversational streaming
EyeWO 2024 - πŸ’» GitHub Eyes Wide Open framework
StreamChat 2025 πŸ“„ arXiv - Interactive streaming chat
StreamBridge 2025 πŸ“„ arXiv - Bridging streaming gaps
VITA 1.5 2025 - πŸ’» GitHub Multimodal streaming
CogStream 2025 πŸ“„ arXiv - Cognitive streaming

⚑ Token & KV Compression

πŸ”₯ KV Cache Compression

πŸ† Method πŸ“… Year πŸ“„ Paper πŸ’» Code 🎯 Core Technique
ReKV 2025 πŸ“„ arXiv - Recursive KV caching
StreamKV 2025 πŸ“„ arXiv - Streaming KV management
InfiniPot-V 2025 πŸ“„ arXiv - Infinite potential vision
StreamMem 2025 πŸ“„ arXiv - Streaming memory system
InfiniteVL 2025 - πŸ’» GitHub Infinite vision-language

🎨 Token / Visual Compression

πŸ† Method πŸ“… Year πŸ“„ Paper πŸ’» Code 🎯 Core Technique
TimeChat-Online (DTD) 2025 πŸ“„ arXiv - Differential Token Drop
VideoLLM-MoD 2024 πŸ“„ arXiv - Mixture of Depths
StreamingTOM 2025 πŸ“„ arXiv - Token-level optimization
STC 2025 πŸ“„ arXiv - Spatial-temporal compression

πŸ—£οΈ Proactive QA Systems

πŸ€– Online / Real-Time / Proactive Output

πŸ† System πŸ“… Year πŸ“„ Paper πŸ’» Code 🎯 Innovation
VideoLLM-online 2024 πŸ“„ arXiv - First online VideoLLM
MMDuet 2024 πŸ“„ arXiv - Dual-mode interaction
MMDuet 2 2025 πŸ“„ OpenReview - Enhanced dual-mode
Dispider 2025 πŸ“„ arXiv - Distributed processing
StreamMind 2025 πŸ“„ arXiv - Cognitive streaming
TimeChat-Online 2025 πŸ“„ arXiv - Temporal understanding
LiveStar 2025 πŸ“„ arXiv - Live streaming star
StreamVLN 2025 πŸ“„ arXiv - Vision-language navigation
LION-FS 2025 πŸ“„ arXiv - Few-shot learning
ROMA 2025 πŸ“„ arXiv - Omni-Multimodal Assistant

πŸ“Š Benchmarks & Datasets

🎯 Streaming / Online Video Understanding Benchmarks

πŸ† Benchmark πŸ“… Year πŸ“„ Paper πŸ’» Code 🎯 Focus Area
OVO-Bench 2025 - πŸ’» GitHub Online video understanding
StreamingBench 2025 πŸ“„ arXiv - Comprehensive streaming eval
OmniMMI 2025 πŸ“„ arXiv - Omni-modal interaction
RTV-Bench 2025 πŸ“„ arXiv - Real-time video benchmark
VStream-QA (RVS) 2024 πŸ“„ arXiv - Video stream QA
StreamBench 2025 - πŸ’» GitHub Streaming benchmark
StreamingCoT 2025 πŸ“„ arXiv - Chain-of-thought streaming
TV-Online 2025 πŸ“„ OpenReview - TV video understanding
ProactiveVideoQA 2025 πŸ“„ arXiv - Proactive QA
SVBench 2025 πŸ“„ arXiv - Streaming video benchmark
OSTBench 2025 - πŸ’» GitHub Online streaming tasks
StreamEQA 2025 - πŸ’» GitHub Embodied QA streaming

πŸ“ˆ Statistics

Category Count Latest Year
πŸš€ Streaming Video LLMs 11 2025
⚑ KV Cache Compression 5 2025
🎨 Token Compression 4 2025
πŸ—£οΈ Proactive QA 9 2025
πŸ“Š Benchmarks 12 2025
Total Papers 41 2025

🀝 Contributing

We welcome contributions! Please feel free to:

  • πŸ“ Submit new papers via Pull Request
  • πŸ› Report issues or suggest improvements via Issues
  • ⭐ Star this repository if you find it helpful!

Contribution Guidelines

  1. Ensure the paper is related to streaming/online video understanding
  2. Provide paper link (arXiv, OpenReview, etc.) or code repository
  3. Include a brief description of the core contribution
  4. Follow the existing table format

πŸ“§ Contact & Collaboration

Feel free to reach out for collaborations or discussions!

Last Updated: December 2025

Made with ❀️ by the Streaming Video Understanding Community

About

Awesome streaming video understanding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors