TrueschoTruescho
All Courses
Big Data Computing with Spark
edX
Course
Intermediate
Free to Audit
Certificate

Big Data Computing with Spark

The Hong Kong University of Science and Technology

Learn the theory and gain hands-on experience of big data systems, using Spark as the exemplary platform.

8 hrs/week8 weeksEnglish3,414 enrolled
Free to Audit

About this Course

Big data systems such as Hadoop and Spark emerge as enabling technologies in managing massive amounts of data across hundreds or even thousands of computing nodes. Meanwhile, cloud computing platforms have made these technologies easily accessible to individuals as well as large enterprises. This course is an online adaptation of the signature course MSBD 5003 Big Data Computing offered to our popular MSc Program in Big Data Technology. In addition to 20+ hours of lecture videos, the course contains 100+ multiple-choice questions and 20 coding questions, aimed at equipping learners with both the theory and practical skills of big data systems, using Spark as the exemplary platform.

What You'll Learn

  • Spark programming using both RDD and DataFrame APIs
  • Useful packages including ML, GraphX/GraphFrames, and SparkStreaming
  • Spark internals and performance optimizations
  • Algorithm design for big data systems

Prerequisites

  • Basic Python programming

Instructors

K

Ke YI

Professor of Computer Science and Engineering, and Director of MSc Program in Big Data Technology

Topics

Apache Hadoop
Cloud Computing
Apache Spark
Nodes (Networking)
Big Data

Course Info

PlatformedX
LevelIntermediate
PacingUnknown
CertificateAvailable
PriceFree to Audit

Skills

Apache Hadoop
الحوسبة السحابية
Apache Spark
العُقد والشبكات
البيانات الضخمة

Start Learning Now