TrueschoTruescho
All Courses
Spark, Hadoop, and Snowflake for Data Engineering
Coursera
Course
Unknown

Spark, Hadoop, and Snowflake for Data Engineering

Duke University

Acquire skills to build scalable data pipelines using Hadoop, Spark, and Snowflake, optimize them, and implement ML solutions on Databricks.

Unknown4 weeksEnglish

About this Course

e.g. This is primarily aimed at first- and second-year undergraduates interested in engineering or science, along with high school students and professionals with an interest in programmingGain the skills for building efficient and scalable data pipelines. Explore essential data engineering platforms (Hadoop, Spark, and Snowflake) as well as learn how to optimize and manage them. Delve into Databricks, a powerful platform for executing data analytics and machine learning tasks, while honing your

What You'll Learn

  • Create scalable data pipelines using Hadoop, Spark, Snowflake, and Databricks
  • Optimize data engineering performance via clustering and scaling
  • Build machine learning solutions with PySpark and MLFlow on Databricks
  • Implement DataOps and DevOps practices for continuous integration and deployment

Prerequisites

  • Basic programming and Python knowledge
  • Fundamental data concepts

Instructors

N

Noah Gift

Interdisciplinary Data Science (MIDS)

K

Kennedy Behrman

Envestnet

M

Matt Harrison

Topics

Python Programming
Data Quality
Apache Hadoop
Data Transformation
MLOps (Machine Learning Operations)
Data Integration
DevOps
Databricks
Data Warehousing
Distributed Computing

Course Info

PlatformCoursera
LevelUnknown
PacingUnknown
PriceFree

Skills

برمجة بايثون
جودة البيانات
أباتشي هادوب
تحويل البيانات
عمليات التعلم الآلي
دمج البيانات
عمليات التطوير والتشغيل
Databricks
Data Warehousing
Distributed Computing

Start Learning Now