- 🔭 Built end-to-end analytics systems in compliance, operations, and finance — actively seeking analyst roles
- 🤝 Looking to collaborate on data analysis, business intelligence, SQL, or dashboard projects
- 📍 Open to Data Analyst roles
- 💬 Ask me about SQL window functions, anomaly detection, Power BI, Tableau, Python for data analysis
- ⚡ Fun fact: I built a 3-layer GST fraud detection system that flags 15% of 50,000 invoices — using pure SQL and statistics, no ML
Languages
Data & Analytics Libraries
Databases
BI & Visualization
Tools
End-to-end analytics system detecting fraudulent GST invoice patterns across 50,000+ records
- 3-layer detection pipeline — Rule-based validation → Statistical anomaly detection → Weighted risk scoring
- SQL window functions for Z-score analysis, rolling average spike detection, and IQR outlier detection
- Flags 7,512 invoices (15%) as suspicious, identifies 35 HIGH-risk vendors out of 210
- Interactive Tableau dashboard — 🔗 View Live
- Stack: Python · PostgreSQL · SQL · Pandas · Tableau
End-to-end credit risk analysis on 307,511 real loan applicants — identifying default patterns across demographics, bureau activity, and credit behavior
- Full pipeline: Python EDA + feature engineering → PostgreSQL business analysis → Tableau dashboard
- Male applicants default at 45% higher rate than females — under-30s are highest risk segment at 11.47%
- Bureau activity is the strongest default predictor — 41+ records show 1.7x higher default rate than low-activity customers
- 4 SQL analysis layers: KPI generation, demographic analysis, credit behavior, advanced risk scoring
- Stack: Python · PostgreSQL · SQL · Tableau | Live Dashboard →
Automated data pipeline simulating real-time railway delays with risk classification
- APScheduler runs the pipeline every 5 minutes automatically
- Classifies delays into HIGH / MEDIUM / LOW risk tiers per train
- Dual storage — rolling CSV (last 10 records/train) + optional PostgreSQL
- Interactive Power BI dashboard with delay trends and risk distribution
- Stack: Python · Pandas · APScheduler · PostgreSQL · SQLAlchemy · Power BI
⭐️ From Saksham3124