Course Outline

Introduction to Hortonworks Data Platform (HDP)

Overview of Big Data and Apache Hadoop

Installing and Configuring HDP

Setting up, Deploying, and Managing Hadoop Cluster

Understanding and ConfiguringYARN and MapReduce

Overview of Job Scheduling

Ensuring Data Integrity

Understanding Enterprise Data Movement

Using HDFS Commands & Services

Transferring Data Using Flume

Working with Hive

Scheduling Workflow Using Oozie

Exploring Hadoop 2.x

Understanding Hbase Architecture

Monitoring HDP2 Services Using Ambari

New Features in HDP

Troubleshooting

Summary and Conclusion

Requirements

  • An understanding of Hadoop and big data.
  • An understanding of Spark.
  • Familiarity with the command line.
  • System administration experience.

Audience

  • Hadoop administrators
  21 Hours
 

Testimonials

Related Courses

Python and Spark for Big Data (PySpark)

  21 hours

Introduction to Graph Computing

  28 hours

Apache Spark MLlib

  35 hours

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

  21 hours

Spark for Developers

  21 hours

Magellan: Geospatial Analytics on Spark

  14 hours

Alluxio: Unifying Disparate Storage Systems

  7 hours

Apache Spark SQL

  7 hours

A Practical Introduction to Stream Processing

  21 hours

Big Data Analytics in Health

  21 hours

Apache Spark in the Cloud

  21 hours

Apache Spark Streaming with Scala

  21 hours

SMACK Stack for Data Science

  14 hours

Apache Spark Fundamentals

  21 hours

Apache Spark for .NET Developers

  21 hours