Suicide Rate Prediction

A comparative machine-learning study to forecast national suicide rates using nine regression algorithms, Genetic Algorithm–based feature selection, and Principal Component Analysis on both public and custom-curated datasets.

Introduction

Suicide-rate forecasting can inform timely public health interventions. This project evaluates nine regression models—KNN, Random Forest, Decision Tree, MLP, Linear Regression, Ridge Regression, and SVR (linear, polynomial, RBF)—under three preprocessing regimes:

Baseline (no feature manipulation)
Genetic Algorithm–based feature selection
Principal Component Analysis (PCA)

Experiments are performed on:

A publicly available Kaggle dataset (1985–2021) with feature pruning due to missingness.
A custom dataset (~84 000 records) merged from WHO, IHME, World Bank, and national sources for richer socio-demographic indicators.

Features

Wrapper-based feature selection via a Genetic Algorithm (GAFeatureSelectionCV)
Dimensionality reduction via PCA retaining 95 % variance
Nine regression algorithms implemented in scikit-learn
Nested cross-validation to prevent data leakage
Comprehensive performance metrics (MAE, MSE, RMSE)

Datasets

Public Kaggle Dataset (first_dataset/)
- 31 756 records, 12 original columns
- HDI, population, and raw counts dropped due to missingness or leakage
Custom-Curated Dataset (second_dataset/)
- ~84 000 records, 10 curated features
- Integrated from WHO, IHME (GBD 2021), World Bank, and Statbank Greenland

Note: Raw and preprocessed CSVs are stored in their respective folders.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
compressed		compressed
first_dataset		first_dataset
second_dataset		second_dataset
Presentation.pdf		Presentation.pdf
README.md		README.md
Suicide Rates Prediction.pdf		Suicide Rates Prediction.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Suicide Rate Prediction

Introduction

Features

Datasets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Suicide Rate Prediction

Introduction

Features

Datasets

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages