Skip to content
#

data-profiling

Here are 16 public repositories matching this topic...

Comprehensive data governance pipeline for SSH honeypot logs—covering data profiling, cleansing, quality assurance, encryption, classification, and GDPR/CCPA/HIPAA compliance. Built with Pandas, Pandera, YData Profiling, and cryptography, with simulated Caesar cipher attacks to demonstrate practical data-security techniques.

  • Updated Jun 23, 2025
  • HTML

The team explored persona‑driven behavioural analytics to address risky resource planning practices. By combining detailed persona definitions, behavioural metrics, and deep analysis of forecasting and utilisation data, they designed a dashboard concept that highlights over‑optimistic planning, generic resource use, and weak feedback loops,...

  • Updated Apr 14, 2026
  • HTML

Collection of APIs for Informatica Intelligent Cloud Services (IICS) and Intelligent Data Management Cloud (IDMC), providing programmatic access to data integration, data governance, data quality, master data management, B2B gateway, and platform administration capabilities.

  • Updated Jun 2, 2026
  • HTML

An end-to-end Business Intelligence (BI) pipeline designed to process and analyze 141 million IMDb records for deriving insights on movies, ratings, and global cinema trends. The project demonstrates large-scale data engineering, ELT automation, and dashboard-driven analytics.

  • Updated Feb 2, 2026
  • HTML

Improve this page

Add a description, image, and links to the data-profiling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-profiling topic, visit your repo's landing page and select "manage topics."

Learn more