
In this course, you'll learn how to manage big datasets, how to load them into clusters and cloud storage, and how to apply structure to the data so that you can run queries on it using distributed SQL engines like Apache Hive and Apache Impala. Youâll learn how to choose the right data types, storage systems, and file formats based on which tools youâll use and what performance you need. By the end of the course, you will be able to ⢠use different tools to browse existing databases and tables
Ian Cook
Cloudera
Glynn Durham
Cloudera