This project leverages transformers for question answering based on an uploaded image. The user interface (UI) is built using Gradio, providing an easy-to-use platform for interacting with the system.
- Upload an image.
- Ask a question related to the uploaded image.
- The system processes the input and returns an accurate answer.
- Upload an image through the Gradio UI.
- Type your question about the image.
- The system analyzes both the image and the question, then provides an answer.
To get started with the project, follow these steps:
- Clone the repository and navigate to the project directory:
git clone https://github.com/arnabsroy9/Visual-Question-Answer.git cd Visual-Question-Answer - Install the required dependencies:
pip install -r requirements.txt
To run, open the terminal in the Visual-Question-Answer/app directory and execute the following command:
python main.py