Team: Topple my Tiger

CODS-COMAD Data Challenge (Sponsored by Meesho)

Approach Brief

How to run the code

The code is divided into three main pipelines Preprocessor, Training, and Inferencing.
Each pipeline is run for each category of the products, and creates the artefacts in their respective output folders.

Setup

Create aand activate a virtual environment as follow:

    python3 -m venv cods-venv && source cods-venv/bin/activate

Install all the dependencies

    pip install -r ./requirements.txt

downlaod all the pre-finetuned models from the following google drive link : Download Models here and extract the archive into the ./Models folder.
alternativelly use gdown as follow:

    pip install gdown && \
    gdown --folder https://drive.google.com/drive/folders/1SrynpY35WIkubQekjgfLw4ZuDYf--UYI?usp=sharing -O ./Models

Download dataset from Kaggle and unzip the dataset into the ./Dataset folder.
Alternatively use Kaggle CLI as follow:

    kaggle competitions download -c visual-taxonomy -p ./Dataset

Running the preprocessor

To run the preprocessor, run the individual .ipynb notebooks for each category.
alternatively use the following one-liner from bash shell to run all the preprocessor notebooks at once (Please ensure adequate RAM and CPU power are present in machine to handle all the notebooks at once).

    cd Preprocessor-FillNA && \
    (trap "pkill -P $$" SIGINT; for notebook in *.ipynb; do jupyter nbconvert --to notebook --execute "$notebook" --output "${notebook%.ipynb}_executed.ipynb" & done; wait)

Running training of models

Ensure you have ran the preprocessor code and the outputs are generated in ./Preprocessor-FillNA/output.
Run individual .ipynb notebooks for each category.
The code will save the models in the ./Models/<category> folder for each category.

Running inference

To run inference, please ensure all the models are downloaded from google drive and placed in the correct folder.
Run the individual .ipynb notebooks for each category to generate the output in the output folder and respective category folders too.
One liner to run all the .ipynb notebooks from command line :

    cd Inferencing && \
    (trap "pkill -P $$" SIGINT; find . -type f -name "*.ipynb" | while read notebook; do jupyter nbconvert --to notebook --execute "$notebook" --output "${notebook%.ipynb}_executed.ipynb" & done; wait)

Building the submission file

To create the final submission file, run Submission_File_Prep.ipynb this will create the submission.csv file in the root folder.
To run the same from command line please use the following:

     jupyter nbconvert --to notebook --execute ./Submission_File_Prep.ipynb --output Submission_File_Prep_executed.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Team: Topple my Tiger

CODS-COMAD Data Challenge (Sponsored by Meesho)

Approach Brief

How to run the code

Setup

Running the preprocessor

Running training of models

Running inference

Building the submission file

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Dataset		Dataset
Inferencing		Inferencing
Models		Models
Preprocessor-FillNA		Preprocessor-FillNA
Training		Training
.gitignore		.gitignore
Readme.md		Readme.md
Submission_File_Prep.ipynb		Submission_File_Prep.ipynb
requirments.txt		requirments.txt
submission.csv		submission.csv

Folders and files

Latest commit

History

Repository files navigation

Team: Topple my Tiger

CODS-COMAD Data Challenge (Sponsored by Meesho)

Approach Brief

How to run the code

Setup

Running the preprocessor

Running training of models

Running inference

Building the submission file

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages