Explore
- #data-engineering 11
- #gcp 6
- #bigquery 5
- #devops 3
- #streaming 3
- #architecture 2
- #grafana 2
- #monitoring 2
- #eks 2
- #kubernetes 2
- #python 2
- #pubsub 2
- #kafka 2
- #dbt 2
- #airflow 2
- #sql 1
- #biglake 1
- #lakehouse 1
- #observability 1
- #opentelemetry 1
- #cicd 1
- #github-actions 1
- #aws 1
- #gke 1
- #fastapi 1
- #api 1
- #dataflow 1
- #analytics-engineering 1
- #data-modeling 1
- #dataproc 1
- #spark 1
- #elt 1
- #cloud-composer 1
- #orchestration 1
- #docker 1
- #cost-optimization 1
- #cloud-storage 1
- #bigtable 1
- #spanner 1
- #snowflake 1
- #postgresql 1
2026
Apache Airflow on GCP - Patterns for Production DAGs
Production-ready patterns for Cloud Composer including DAG design, error handling, secrets management, and monitoring strategies.
Docker for Data Engineers — Containerizing Python Pipelines
Build reproducible data pipelines with Docker. Covers multi-stage builds, dependency management, and patterns for PySpark and Airflow containers.
BigQuery Cost Optimization - 5 Patterns Every Data Engineer Should Know
Reduce BigQuery costs with partitioning, clustering, materialized views, and query optimization techniques that actually work in production.
Understanding GCP's Data Storage Spectrum - When to Use What
A practical guide to choosing between Cloud Storage, BigQuery, Bigtable, and Spanner based on your data access patterns and scale requirements.
Real-Time Banking CDC Pipeline
Captures banking transaction changes in real-time using CDC, transforming operational data into analytics-ready models for business intelligence.