TrueschoTruescho
All Courses
Building Automated Data Pipelines with Spark, dbt, and Airflow
Coursera
Course
Unknown

Building Automated Data Pipelines with Spark, dbt, and Airflow

Coursera

Master building automated data pipelines processing millions of records using Apache Spark, dbt, and Airflow with robust workflows and performance optimization.

Unknown11 weeksEnglish

About this Course

You'll master the art of building production-ready data pipelines that automatically process millions of records. In this hands-on course, you'll design end-to-end workflows that integrate diverse data sources—from databases and APIs to real-time streams—using industry-standard tools like Apache Spark, dbt, and Apache Airflow. You'll learn to create robust data models that preserve historical changes, implement performance optimizations that reduce processing time by 30% or more, and build automated workflows with intelligent retry logic and monitoring alerts. By the end, you'll have created a complete data pipeline system that demonstrates the technical skills data engineering teams need most. You'll know how to unify fragmented data sources, apply advanced transformation techniques, and ensure your pipelines run reliably at scale. This practical experience directly translates to the challenges you'll face as a data engineer, data analyst, or anyone working with large-scale data systems in modern organizations

What You'll Learn

  • Build end-to-end data pipelines ingesting from databases, APIs, and streams
  • Design data models with complete historical tracking using SCD Type 2
  • Create automated workflows with intelligent retry logic and SLA monitoring
  • Optimize Spark jobs using partitioning and caching for better performance

Prerequisites

  • Basic computer and internet skills
  • Ability to read instructions in English and complete short practice activities

Instructors

P

Professionals from the Industry

Topics

Data Analysis
Data Science
Probability and Statistics
Data Integration
Data Pipelines
Data Quality
Configuration Management
Data Warehousing
Apache Airflow
Data Architecture

Course Info

PlatformCoursera
LevelUnknown
PacingUnknown
PriceFree

Skills

تحليل البيانات
علوم البيانات
الاحتمالات والإحصاء
دمج البيانات
خطوط أنابيب البيانات
جودة البيانات
إدارة التكوين
تخزين البيانات
Apache Airflow
Data Architecture

Start Learning Now