Style Finder: Multimodal Fashion Analysis & Retrieval

This project is a multimodal AI application that analyzes fashion images, finds visually similar outfits, and suggests affordable alternatives from online stores. It combines computer vision, large language models, and web search to deliver an interactive fashion analysis experience.

Features

Image Analysis: Upload a fashion image to extract visual features using a pre-trained ResNet50 model.
Similarity Matching: Finds the closest matching outfit from a curated dataset using cosine similarity.
AI-Powered Description: Uses IBM watsonx Llama 3.2 Vision Instruct model to generate detailed, educational fashion analysis.
Shopping Alternatives: Integrates with SerpAPI to search for similar items online and present affordable alternatives.
Interactive Gradio UI: User-friendly web interface for uploading images, viewing results, and exploring example outfits.

Project Structure

.
├── app.py                  # Main Gradio application
├── config.py               # Configuration settings
├── requirements.txt        # Python dependencies
├── README.md               # Project documentation
├── models/
│   ├── image_processor.py  # Image encoding and similarity logic
│   └── llm_service.py      # LLM (Llama Vision) service integration
├── services/
│   └── search_service.py   # SerpAPI-based product search
├── utils/
│   └── helpers.py          # Utility functions
├── examples/               # Example images for demo
│   ├── test-6.png
│   └── test-7.png
└── swift-style-embeddings.pkl # (Not included) Fashion dataset with embeddings

Setup Instructions

Clone the Repository
```
git clone <repo-url>
cd style-finder
```

Install Dependencies

It is recommended to use a virtual environment.

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Prepare the Dataset
- Place your swift-style-embeddings.pkl file (fashion dataset with precomputed embeddings) in the project root directory.
Configure API Keys
- For full functionality (shopping alternatives), obtain a SerpAPI key.
- Set your SerpAPI key as an environment variable or modify app.py to pass it to StyleFinderApp.
Run the Application
```
python app.py
```
The Gradio interface will launch locally at http://127.0.0.1:5000.

Usage

Upload an Image: Use the Gradio UI to upload a fashion image or select an example.
Analyze Style: Click "Analyze Style" to receive a detailed breakdown and shopping alternatives.
Explore Results: Review the AI-generated analysis and links to similar products.

Customization

Model & API Settings: Adjust model IDs, region, and thresholds in config.py.
Dataset: Replace swift-style-embeddings.pkl with your own dataset for different fashion domains.

Dependencies

See requirements.txt for the full list.

Key packages:

torch, torchvision, pillow, scikit-learn, pandas, numpy
ibm-watsonx-ai (for Llama Vision)
google-search-results (for SerpAPI)
gradio (for the web UI)

License

This project is for educational purposes. See the repository for license details.

Acknowledgments:
Built as part of the IBM RAG Agentic AI Certification course on

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Style Finder: Multimodal Fashion Analysis & Retrieval

Features

Project Structure

Setup Instructions

Usage

Customization

Dependencies

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
examples		examples
models		models
services		services
utils		utils
.gitignore		.gitignore
README.md		README.md
app.py		app.py
config.py		config.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Style Finder: Multimodal Fashion Analysis & Retrieval

Features

Project Structure

Setup Instructions

Usage

Customization

Dependencies

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages