I build data pipelines and analytics systems that turn messy operational data into business-ready insights.
Recently: managed the data infrastructure for a โน5Cr+ monthly revenue VISA filing business โ automated 85% of manual reporting, reduced reconciliation errors from 12% to 2%, and helped recover โน20L in overdue credit through data-driven AR management.
Currently: Working on scalable ETL pipelines using Python, SQL, BigQuery, and AWS to process large datasets and create executive dashboards that actually get used in board presentations.
Built a star-schema data warehouse in AWS Redshift analyzing 5,000+ player performance records. Designed automated ETL pipeline using Python and AWS Glue, created Power BI dashboards for role-based performance comparison targeting auction teams and coaches.
Tech: Python, AWS (S3, Glue, Redshift), Power BI, ETL pipelines
Developed 60% faster asynchronous web scraping + ETL pipeline processing 30,000+ customer reviews. Built BigQuery star-schema warehouse with sentiment analysis to create actionable dashboards for CX teams.
Tech: Python, AWS Glue, BigQuery, NLP, Async/Await, Power BI
๐ญ Building: Scalable data pipelines for multi-source integration
๐ฑ Learning: Containerization (Docker), CI/CD for DataOps, and advanced cloud orchestration
๐ Goal: Transition into Data Engineering roles at product companies
Looking for Data Analyst, BI Analyst, or Junior Data Engineer roles where I can:
- Build scalable data systems from scratch
- Collaborate with product and tech teams
- Turn complex data into actionable business insights
๐ง Let's connect: yash.jain106@gmail.com | LinkedIn
When I'm not building pipelines, you'll find me:
- ๐ Reading about theoretical cosmology and self-improvement
- ๐ฅ Practicing Kyudo and Jiu-Jitsu
- ๐ฎ Gaming (strategy and RPGs)
- ๐ญ Exploring scientific research papers on arXiv


