Projects
Case studies on data engineering and cloud infrastructure work.
AWS EMR Spark Pipeline
Distributed data processing on AWS EMR with PySpark for large-scale financial data analysis. Features S3 integration and scalable cluster configuration.
Portfolio Website
Personal portfolio built with Astro.js, React, and Tailwind CSS. Achieves perfect 100 Lighthouse performance score with optimized Core Web Vitals.
ELT Pipeline with dbt + Snowflake
Production ELT pipeline on Snowflake with dbt transformations, medallion architecture, dimensional modeling, Airflow orchestration, and CI/CD workflows.
PySpark SQL Tutorial
Educational project demonstrating PySpark DataFrame operations, SQL queries, window functions, and distributed data processing patterns.
Production Data Platform
Built production data platform at a genomics startup—multi-source ingestion framework, Kubernetes infrastructure, data governance, and ML-ready pipelines processing 20TB+ data.
AI Travel Recommendation App
Serverless application using AWS Bedrock and Claude for personalized travel recommendations. React frontend with Cognito authentication.
Full-Stack SaaS Application
Multi-tenant web application with Next.js, Supabase, and Stripe. Features AI document processing, authentication, and subscription management.