[CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
-
Updated
Sep 19, 2025 - Python
[CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
[AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.
[NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts
视频课程学习工作台:字幕提取、课程管理、AI 学习材料与 ASR 校正。Video course workspace for subtitles, course collections, AI study notes, and ASR correction.
Learn Arabic and German by learning the vocabulary to understand native speaker videos.
AI Teaching Assistant that summarizes lectures, answers questions, generates slides, and creates quizzes from audio/video lectures.
Add a description, image, and links to the video-learning topic page so that developers can more easily learn about it.
To associate your repository with the video-learning topic, visit your repo's landing page and select "manage topics."