This repository is for implementing large multimodal models (LMMs) to perform temporal action localization (TAL).
Currently, this repo supports the following:
- [Model] Gemini; GPT4;
- [Dataset] THUMOS14; FienAction;
- Fill api key in the files
api_key/{gemini | openai}.txt - Download datasets