Skip to content
View yashj1301's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Block or report yashj1301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
yashj1301/README.md

Hey ๐Ÿ‘‹, I'm Yash Jain!

linkedin gravatar

๐Ÿš€ What I Do

I build data pipelines and analytics systems that turn messy operational data into business-ready insights.

Recently: managed the data infrastructure for a โ‚น5Cr+ monthly revenue VISA filing business โ€” automated 85% of manual reporting, reduced reconciliation errors from 12% to 2%, and helped recover โ‚น20L in overdue credit through data-driven AR management.

Currently: Working on scalable ETL pipelines using Python, SQL, BigQuery, and AWS to process large datasets and create executive dashboards that actually get used in board presentations.


๐Ÿ”จ Tech Stack

Languages & Databases:
Python SQL

Cloud & Data Engineering:
GCP BigQuery AWS AWS S3 AWS Redshift Docker

BI & Visualization:
Power BI Looker

Tools & Frameworks:
Pandas NumPy Git Selenium


๐Ÿ“Š Featured Projects

Built a star-schema data warehouse in AWS Redshift analyzing 5,000+ player performance records. Designed automated ETL pipeline using Python and AWS Glue, created Power BI dashboards for role-based performance comparison targeting auction teams and coaches.

Tech: Python, AWS (S3, Glue, Redshift), Power BI, ETL pipelines


Developed 60% faster asynchronous web scraping + ETL pipeline processing 30,000+ customer reviews. Built BigQuery star-schema warehouse with sentiment analysis to create actionable dashboards for CX teams.

Tech: Python, AWS Glue, BigQuery, NLP, Async/Await, Power BI


๐Ÿ’ผ What I'm Working On

๐Ÿ”ญ Building: Scalable data pipelines for multi-source integration
๐ŸŒฑ Learning: Containerization (Docker), CI/CD for DataOps, and advanced cloud orchestration
๐Ÿ“ˆ Goal: Transition into Data Engineering roles at product companies


๐ŸŽฏ Open to Opportunities

Looking for Data Analyst, BI Analyst, or Junior Data Engineer roles where I can:

  • Build scalable data systems from scratch
  • Collaborate with product and tech teams
  • Turn complex data into actionable business insights

๐Ÿ“ง Let's connect: yash.jain106@gmail.com | LinkedIn


๐Ÿ“š Beyond Code

When I'm not building pipelines, you'll find me:

  • ๐Ÿ“– Reading about theoretical cosmology and self-improvement
  • ๐Ÿฅ‹ Practicing Kyudo and Jiu-Jitsu
  • ๐ŸŽฎ Gaming (strategy and RPGs)
  • ๐Ÿ”ญ Exploring scientific research papers on arXiv

๐Ÿ“Š GitHub Stats

Profile views


โญ๏ธ If you find my projects interesting, feel free to star them or reach out for collaboration!

Pinned Loading

  1. airlines-reviews-analysis airlines-reviews-analysis Public

    This project focuses on analyzing airlines' customer reviews from Skytrax website to perform review analysis to understand customer experiences.

    Jupyter Notebook

  2. cricketer-stats cricketer-stats Public

    This repository is an end-to-end project to extract cricketer statistics from the ESPN CricInfo website. The rights to the data belong to ESPN CricInfo.

    Jupyter Notebook

  3. lead-scoring lead-scoring Public

    This project is a Lead Scoring Case Study, built as part of the UpGrad Data Science course, to help businesses identify high-converting leads using Logistic Regression Machine Learning model.

    Jupyter Notebook