An open-source framework for multi-cloud cost visibility. Extendable with dlt.
-
Updated
Nov 21, 2025 - Python
An open-source framework for multi-cloud cost visibility. Extendable with dlt.
Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.
Фабрика DAG
Realtime Data Streaming | End-to-End Data Engineering Project
Build a data pipeline on Google Cloud using an event-driven architecture, leveraging GCS, Cloud Run functions, and BigQuery. Explore both VM and Composer options for Airflow management, and utilize Logging & Monitoring for pipeline health. Discover how SQL-based BigQuery ML can be used for initial ML implementation in specific scenarios.
Инфраструктура для data engineer S3
infrastructure_for_data_engineer_kafka
Всё что нужно знать про DuckDB
Power BI dashboard analyzing client credit default patterns
A collection of real-world projects, code-alongs, and competitions completed on DataCamp.
Фабрика DAG через SCD-таблицу с конфигурациями
Leveraging AWS Cloud Services, an ETL pipeline transforms YouTube video statistics data. Data is downloaded from Kaggle, uploaded to an S3 bucket, and cataloged using AWS Glue for querying with Athena. AWS Lambda and Glue converts to Parquet format and stores it in a cleansed S3 bucket. AWS QuickSight then visualizes the materialised data.
Add a description, image, and links to the data-engineering-project topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering-project topic, visit your repo's landing page and select "manage topics."