TrueschoTruescho
All Courses
Big Data - Capstone Project
Coursera
Course
Unknown

Big Data - Capstone Project

University of California San Diego

Capstone project integrating big data skills to build ecosystems and analyze simulated datasets using advanced tools and methodologies.

Unknown7 weeksEnglish18,853 enrolled

About this Course

Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership

What You'll Learn

  • Build an integrated big data ecosystem using advanced tools
  • Apply methodologies learned in prior courses
  • Analyze simulated user game data for big data concepts

Prerequisites

  • Basic computer and internet skills
  • Ability to read English instructions and complete short practice activities

Instructors

I

Ilkay Altintas

Chief Data Science Officer

A

Amarnath Gupta

Director, Advanced Query Processing Lab

Topics

Data Analysis
Data Science
Apache Spark
Big Data
Network Analysis
Splunk
Unstructured Data
Classification Algorithms
Analytics
Data Wrangling

Course Info

PlatformCoursera
LevelUnknown
PacingUnknown
PriceFree

Skills

تحليل البيانات
علم البيانات
أباتشي سبارك
البيانات الكبيرة
تحليل الشبكات
سبلنك
البيانات غير المهيكلة
خوارزميات التصنيف
Analytics
Data Wrangling

Start Learning Now