Semantic Car Segmentation with Deep Learning Models

This project explores the application of various deep learning architectures for semantic segmentation of car images. It includes the implementation, training, and evaluation of models like UNet, FPN, LinkNet, and PSPNet, utilizing different backbones and image resolutions to compare their performance.

Project Overview

The primary goal is to accurately segment cars from images, identifying different parts across 5 distinct classes. The project systematically experiments with several state-of-the-art segmentation models to compare their performance on a dedicated car image dataset.

Key activities include:

Loading and preprocessing image and mask data at various resolutions (128x128, 144x144, 256x256).
Augmenting the dataset to improve model generalization.
Implementing and training multiple segmentation models using the segmentation-models library.
Evaluating model performance using metrics like IoU (Intersection over Union) and F1-Score.
Visually comparing prediction results across different architectures, backbones, and image input sizes.

Implemented Models

The following models have been implemented, trained, and compared:

UNet:
- Trained on 128x128 images.
- The trained model is loaded from UNet_segm.hdf5.
FPN (Feature Pyramid Network):
- With vgg16 backbone, trained on 128x128 and 256x256 images.
- With resnet34 backbone, trained on 128x128 images.
- Notebooks: FPN_VGG16_segm_original.ipynb, FPN_segm_aumentato.ipynb, FPN_VGG16_256.ipynb.
LinkNet:
- Employs a resnet34 backbone.
- Trained on the augmented dataset with 128x128 images.
- Implementation can be found in LinkNet.ipynb.
PSPNet (Pyramid Scene Parsing Network):
- Uses a resnet34 backbone.
- Experiments were conducted with different image resolutions: 144x144 (PSPNet_144x144.ipynb) and 192x192 (PSPNet_192x192.ipynb).

Dataset

The project uses a car image dataset with corresponding segmentation masks. The dataset consists of 5 classes.

Data Augmentation

To enhance the dataset and improve model robustness, various data augmentation techniques were applied. The notebook dataAugmentation.ipynb implements the following transformations:

Horizontal Mirroring
Gaussian Noise
Color Jittering
Blur
Random Rotations

Final Results

Final IoU score reached is about 89% We also applied the best model (FPN) to our cars.

Workspace Structure

The repository is organized into several Jupyter notebooks, each dedicated to a specific model or task.

UNet_*.ipynb, FPN_*.ipynb, LinkNet.ipynb, PSPNet_*.ipynb: Notebooks for training specific models.
dataAugmentation.ipynb: Contains the code for data augmentation.
confronto_risultati_dataset.ipynb: Used for loading all trained models and visually comparing their prediction results on the test set.
*.log: Log files generated during model training, containing metrics for each epoch.
*.hdf5: Saved model weights after training.
Esercizi_Lezione: A directory containing notebooks with exercises on fundamental deep learning concepts.

Setup and Dependencies

This project is built using Python and relies on several deep learning and computer vision libraries.

Main Dependencies

TensorFlow / Keras
segmentation-models
OpenCV-Python
Scikit-learn
NumPy
Matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
Altre		Altre
Esercizi_Lezione		Esercizi_Lezione
macchine_nostre		macchine_nostre
FPN_VGG16_256.ipynb		FPN_VGG16_256.ipynb
FPN_VGG16_256.log		FPN_VGG16_256.log
FPN_VGG16_original.log		FPN_VGG16_original.log
FPN_VGG16_segm .ipynb		FPN_VGG16_segm .ipynb
FPN_VGG16_segm.log		FPN_VGG16_segm.log
FPN_VGG16_segm_original.ipynb		FPN_VGG16_segm_original.ipynb
FPN_segm.ipynb		FPN_segm.ipynb
FPN_segm.log		FPN_segm.log
FPN_segm_aumentato.ipynb		FPN_segm_aumentato.ipynb
FPN_segm_aumentato.log		FPN_segm_aumentato.log
LinkNet.ipynb		LinkNet.ipynb
LinkNet_128x128.log		LinkNet_128x128.log
PSPNet_144x144.ipynb		PSPNet_144x144.ipynb
PSPNet_192x192.ipynb		PSPNet_192x192.ipynb
PSPNet_192x192.log		PSPNet_192x192.log
PSP_144x144.log		PSP_144x144.log
PSP_prova copy.log		PSP_prova copy.log
Progetto_Deep_Learning.pdf		Progetto_Deep_Learning.pdf
Risultati_ FPN_VGG16_original.ipynb		Risultati_ FPN_VGG16_original.ipynb
Risultati_ FPN_VGG16_segm.ipynb		Risultati_ FPN_VGG16_segm.ipynb
Risultati_ FPN_VGG16_segm_256.ipynb		Risultati_ FPN_VGG16_segm_256.ipynb
Risultati_FPN_aumentato.ipynb		Risultati_FPN_aumentato.ipynb
Risultati_FPN_segm.ipynb		Risultati_FPN_segm.ipynb
Risultati_LinkNet .ipynb		Risultati_LinkNet .ipynb
Risultati_PSPNet_144x144.ipynb		Risultati_PSPNet_144x144.ipynb
Risultati_PSPNet_192x192.ipynb		Risultati_PSPNet_192x192.ipynb
Risultati_UNet1.ipynb		Risultati_UNet1.ipynb
Risultati_UNet_256.ipynb		Risultati_UNet_256.ipynb
Risultati_UNet_segm.ipynb		Risultati_UNet_segm.ipynb
Risultati_UNet_segm_256.ipynb		Risultati_UNet_segm_256.ipynb
Risultati_macchine_nostre.ipynb		Risultati_macchine_nostre.ipynb
UNet1.ipynb		UNet1.ipynb
UNet1.log		UNet1.log
UNet_256.ipynb		UNet_256.ipynb
UNet_256.log		UNet_256.log
UNet_256_p2.log		UNet_256_p2.log
UNet_256_p3.log		UNet_256_p3.log
UNet_segm.ipynb		UNet_segm.ipynb
UNet_segm.log		UNet_segm.log
UNet_segm_256.ipynb		UNet_segm_256.ipynb
UNet_segm_256.log		UNet_segm_256.log
confronto_risultati_dataset.ipynb		confronto_risultati_dataset.ipynb
dataAugmentation.ipynb		dataAugmentation.ipynb
readMe.md		readMe.md
showImages.ipynb		showImages.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Car Segmentation with Deep Learning Models

Project Overview

Implemented Models

Dataset

Data Augmentation

Final Results

Workspace Structure

Setup and Dependencies

Main Dependencies

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

ele10-code/Deep_Learning

Folders and files

Latest commit

History

Repository files navigation

Semantic Car Segmentation with Deep Learning Models

Project Overview

Implemented Models

Dataset

Data Augmentation

Final Results

Workspace Structure

Setup and Dependencies

Main Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages