Skip to content
View sp-202's full-sized avatar
πŸ˜ƒ
πŸ˜ƒ

Block or report sp-202

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sp-202/README.md

Hi πŸ‘‹, I'm Subhodeep Pal

Data Engineer | Platform Architect | Open Source Enthusiast

Typing SVG


πŸš€ About Me

I am a Data Engineer focused on building robust, scalable Lakehouse Architectures. I specialize in deploying distributed compute engines on Kubernetes, optimizing Spark workloads, and automating complex data pipelines.

  • πŸ”­ Current Project: Building a production-grade Data Lakehouse on Kubernetes featuring:
    • Compute: Apache Spark 4.0.1 (Spark Connect) with custom Docker images.
    • Storage: Delta Lake 4.0 on MinIO (S3) with Unity Catalog OSS.
    • Serving: StarRocks for sub-second BI query performance.
    • Orchestration: Apache Airflow with complex DAG dependencies.
  • 🌱 Currently Learning: Advanced Observability (Loki/Promtail) and Bare Metal K8s networking.
  • πŸ‘― Looking to Collaborate: Open source projects related to Data Engineering, Spark, or Cloud Infrastructure.
  • πŸ’¬ Ask Me About: - Apache Spark (Optimization, K8s Deployment)
    • Kubernetes (Operators, Helm, Networking)
    • Delta Lake & Data Strategy
    • CI/CD for Data

πŸ› οΈ The Tech Stack

Big Data & Distributed Systems

Infrastructure & DevOps

Languages


πŸ“ˆ Github Stats

Subhodeep's GitHub Stats
GitHub Streak

"I automate everything I can, and I optimize what I can't."

Pinned Loading

  1. airflow-dags airflow-dags Public

    All airflow-dgas

    Python

  2. apache/superset apache/superset Public

    Apache Superset is a Data Visualization and Data Exploration Platform

    TypeScript 73k 17.4k

  3. spark-delta-lake spark-delta-lake Public

    Shell