This repository contains a data analysis and visualization project using Python (Pandas) and Tableau. It explores sales trends, product availability, and key insights from an eCommerce dataset. The project includes data cleaning, EDA, and interactive dashboards for better business insights.
This project focuses on analyzing sales data using Python (Pandas) for data cleaning and Tableau for visualization. The dataset includes product details, availability, pricing, and sales trends.
-
Data Cleaning & Preprocessing (Handled in Jupyter Notebook / Google Colab)
- Removed null values
- Converted
lastUpdatedcolumn to datetime format - Processed price values and extracted useful insights
-
Exploratory Data Analysis (EDA)
- Checked data distributions
- Identified trends in product availability and sales
-
Visualization in Tableau
- Sales Trend Over Time (Line Chart)
- Top-Selling Products (Bar Chart)
- Availability vs. Sales Performance (Heatmap)
- Geographical Sales Insights (Map Visualization)
-
Dashboard Creation
- Combined multiple visualizations into an interactive Tableau dashboard
- The dataset and cleaned version are provided in the
data/folder - Python scripts for data preprocessing are in the
notebooks/folder - Tableau workbook (.twbx) file is included for visualization
🔹 Python (Pandas, Matplotlib, Seaborn) – Data Cleaning & Analysis
🔹 Tableau – Data Visualization
🔹 Git & GitHub – Version Control
To run the Jupyter notebook:
git clone https://github.com/YOUR_GITHUB_USERNAME/YOUR_REPO.git
cd YOUR_REPO
pip install pandas matplotlib seaborn
jupyter notebook