Home¶
Hari Prasanna
Data Engineer — End-to-end data lifecycle specialist with a learn-by-doing philosophy.
🛠️ Tech Stack
Python SQL Bash Apps Script Databricks Delta Lake dbt Pandas PySpark Airflow Docker AWS S3 & IAM PostgreSQL Oracle Looker Power BI Grafana GitHub Actions Git Ubuntu
💼 Work Projects — Production Systems
Oracle → Google Sheets ETL Pipeline
ProductionAutomated a 100 min/day manual report to fully autonomous — powers the DG Monitor Dashboard
Real-Time KPI TV Dashboard
ProductionReplaced a €10K vendor proposal with an in-house solution running at under €70/month
🚀 Personal Projects
Shopstream: Clickstream Lakehouse
In progressE-commerce conversion funnel analytics on a Medallion Architecture with Delta Lake
TMDB: Local to Cloud Lakehouse
CompletedMigrated a local Postgres pipeline to a cloud lakehouse on AWS + Databricks
Zalando LUU Returns Pipeline
CompletedMock ELT with injected anomalies, modeled into a Star Schema via dbt
Airflow Orchestration
InfraCentralized DAG management with "Clean Room" env isolation
CI/CD Pipelines
ActiveGitHub Actions workflows with path filters for multi-project repo
Weather ELT Pipeline
CompletedFully autonomous local pipeline — zero-touch Cron automation for 10 German cities
🎓 Certifications
Databricks Certified Data Engineer Associate
Databricks
mkdocs-material — each project gets its own nested section with sub-pages for architecture diagrams, code walkthroughs, and lessons learned.