repositories Search Results · topic:foundation-models org:mbzuai-oryx
Filter by
0 results
(182 ms)0 results
inmbzuai-oryx (press backspace or delete to remove)[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that…
- Python
- 945
- Updated on Aug 5, 2025
GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …
- Python
- 144
- Updated on May 28, 2025
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
- Python
- 97
- Updated on Apr 14, 2025
Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks
- Jupyter Notebook
- 37
- Updated on Nov 27, 2025

Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.
Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.