Skip to content

repositories Search Results · topic:benchmarking org:mbzuai-oryx

Filter by

0 results
 (90 ms)

0 results

inmbzuai-oryx (press backspace or delete to remove)

[CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the …
  • Python
  • 46
  • Updated
    on May 26, 2025

Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks
  • Jupyter Notebook
  • 37
  • Updated
    on Nov 27, 2025
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.