Reporium
GraphWikiTaxonomyStacksInsightsTrendsArchitectureAI-NativeFAQ
Ask anything about the repo library…
Loading repo…
←Library/data-engineering-zoomcamp
Library/data-engineering-zoomcampForked

DataTalksClub/data-engineering-zoomcamp

data-engineering-zoomcamp

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

View on GitHub↗Upstream DataTalksClub/data-engineering-zoomcamp↗

Builder

DataTalksClub

DataTalksClub

DataTalksClub • individual

Stars

41,636

Using upstream star count

Forks

8,279

Using upstream fork count

Open Issues

0

Activity Score

0/100

0 commits in 30d

Created

Oct 21, 2021

Project creation date

README Summary

<p align="center"> <img width="100%" src="/images/architecture/arch_v5_workshops.png" alt="Data Engineering Zoomcamp Overview"> </p>

Community Evaluation

Loading…

AI Dev Skills

Unmapped

Batch ProcessingCloud Data EngineeringCloud Data PlatformsData InfrastructureData ModelingData OrchestrationData Pipeline ArchitectureData Pipeline DesignData Quality and TestingData Quality and ValidationData WarehousingDocker ContainerizationETL/ELT DesignETL/ELT Design PatternsETL/ELT WorkflowsInfrastructure as CodeProduction Data SystemsSQL and Database DesignSQL OptimizationStream ProcessingWorkflow AutomationWorkflow Orchestration

Tags

Batch ProcessingCloud Data EngineeringCloud Data PlatformsData InfrastructureData ModelingData OrchestrationData Pipeline ArchitectureData Pipeline DesignData Quality and TestingData Quality and ValidationData WarehousingDocker ContainerizationETL/ELT DesignETL/ELT Design PatternsETL/ELT WorkflowsInfrastructure as CodeProduction Data SystemsSQL and Database DesignSQL OptimizationStream ProcessingWorkflow AutomationWorkflow OrchestrationAutomationCourseData EngineeringDatabaseDockerForkedGoogle CloudMachine LearningPythonReal-Time / StreamingSparkTutorial

Taxonomy

category

Dev Tools & AutomationInference & ServingMLOps & InfrastructureCloud & PlatformsLearning ResourcesData Science & Analytics

Deployment Context

CloudOn-premiseSelf-hosted

Industries

EducationDeveloper ToolsAnalyticsData-driven Organizations

Modalities

TabularCodeStructured Data

Skill Areas

Data Pipeline DesignETL/ELT WorkflowsData WarehousingData OrchestrationCloud Data PlatformsSQL OptimizationData ModelingStream ProcessingBatch ProcessingData Quality and TestingInfrastructure as CodeDocker ContainerizationWorkflow AutomationETL/ELT Design PatternsSQL and Database DesignData Quality and ValidationData Pipeline ArchitectureETL/ELT DesignWorkflow OrchestrationCloud Data EngineeringData InfrastructureProduction Data Systems

tag

AutomationCourseData EngineeringDatabaseDockerForkedGoogle CloudMachine LearningPythonReal-Time / StreamingSparkTutorial

Use Cases

Building production data pipelinesData warehouse design and implementationETL pipeline developmentData orchestration and schedulingCloud data platform setup and managementReal-world data engineering project implementationBuilding ETL pipelinesReal-time data processingData pipeline orchestrationProduction data system architectureLearning data pipeline developmentBuilding ETL workflowsData warehouse designStream processing implementationProduction data system deployment

Recent Activity

Updated 2 months ago

7 Days

0

30 Days

0

90 Days

9

Document Bruin MCP integration steps for VS Code (#832)

motho17 • Mar 19, 2026

ef44b88

Add files via upload (#833)

Khang Tran • Mar 19, 2026

f556028

Update streaming homework with verified answers and setup hints

Alexey Grigorev • Mar 12, 2026

16fdabf

Quality

production
Quality
high
Maturity
production

Categories

Dev Tools & AutomationPrimaryInference & ServingMLOps & InfrastructureCloud & PlatformsLearning ResourcesData Science & AnalyticsOther AI / ML

PM Skills

Scale & ReliabilityDeveloper Platform

Languages

Jupyter Notebook100.0%

Timeline

Project created
Oct 21, 2021
Forked
Mar 26, 2026
Your last push
2 months ago
Upstream last push
1 months ago
Tracked since
Mar 19, 2026

Similar Repos

pgvector cosine similarity · $0

Loading…