- 📫How to reach me laijupjoy01@gmail.com
- Big Data Engineering
- Machine Learning
- Deep Learning
- Computer Vision
- MLOps
- AWS
- AZURE
- Embedded Systems & IoT
Building a complete data warehouse for an e-commerce application to perform Apache Hive analytics on various datasets using big data tools like Apache Sqoop, Apache Spark, and HDFS. In order to ach…
Python 1
The MySQL Kafka S3 Redshift Pipeline enables real-time data transfer from MySQL to Redshift. Leveraging Kafka for streaming, S3 for storage, and Redshift for analytics, it ensures efficient, scalab…
Python
Implement an AWS Glue ETL pipeline for data extraction, transformation, and loading into Amazon Redshift. Enhance data warehousing efficiency and analytics capabilities.
Python
This Project goes over how to design a ELT system using AWS EMR and Hive. The main objective is to keep the code complexity and server management low, while automating as much as possible
HiveQL
Here, implementing a credit card fraud detection system, by using big data technologies, like Hadoop, Spark, Apache Kafka
Scala
The AWS Snowflake data pipeline, powered by Kinesis and Airflow, offers a robust solution for scalable, reliable, and automated data processing in the cloud
Python