Course Outline

To request a customized course outline for this training, please contact us.

Requirements

  • Linux experience
  • Knowledge of database concepts and practices
  • Knowledge of Hadoop infrastructure and practices
  21 Hours
 

Testimonials

Related Courses

Alluxio: Unifying Disparate Storage Systems

 7 hours

Alluxio is an open-source virtual distributed storage system that unifies disparate storage systems and enables applications to interact with data at memory speed. It is used by companies such as Intel, Baidu and Alibaba. In this instructor-led,

Administrator Training for Apache Hadoop

 35 hours

Audience: The course is intended for IT specialists looking for a solution to store and process large data sets in a distributed system environment Goal: Deep knowledge on Hadoop cluster

Apache Hadoop: Manipulation and Transformation of Data Performance

 21 hours

This course is intended for developers, architects, data scientists or any profile that requires access to data either intensively or on a regular basis. The major focus of the course is data manipulation and transformation. Among the tools

Hadoop Administration

 21 hours

The course is dedicated to IT specialists that are looking for a solution to store and process large data sets in distributed system environment Course goal: Getting knowledge regarding Hadoop cluster

Hadoop For Administrators

 21 hours

Apache Hadoop is the most popular framework for processing Big Data on clusters of servers. In this three (optionally, four) days course, attendees will learn about the business benefits and use cases for Hadoop and its ecosystem, how to plan

Hadoop for Business Analysts

 21 hours

Apache Hadoop is the most popular framework for processing Big Data. Hadoop provides rich and deep analytics capability, and it is making in-roads in to tradional BI analytics world. This course will introduce an analyst to the core components of

Hadoop for Developers (4 days)

 28 hours

Apache Hadoop is the most popular framework for processing Big Data on clusters of servers. This course will introduce a developer to various components (HDFS, MapReduce, Pig, Hive and HBase) Hadoop

Advanced Hadoop for Developers

 21 hours

Apache Hadoop is one of the most popular frameworks for processing Big Data on clusters of servers. This course delves into data management in HDFS, advanced Pig, Hive, and HBase.  These advanced programming techniques will be beneficial to

Hadoop for Developers and Administrators

 21 hours

Hadoop is the most popular Big Data processing framework.

Hadoop for Project Managers

 14 hours

As more and more software and IT projects migrate from local processing and data management to distributed processing and big data storage, Project Managers are finding the need to upgrade their knowledge and skills to grasp the concepts and

Hadoop Administration on MapR

 28 hours

Audience: This course is intended to demystify big data/hadoop technology and to show it is not difficult to understand.

HBase for Developers

 21 hours

This course introduces HBase – a NoSQL store on top of Hadoop.  The course is intended for developers who will be using HBase to develop applications,  and administrators who will manage HBase clusters. We will walk a developer

Apache Avro: Data Serialization for Distributed Applications

 14 hours

Audience Developers Format of the Course Lectures, hands-on practice, small tests along the way to gauge understanding

Samza for Stream Processing

 14 hours

Apache Samza is an open-source near-realtime, asynchronous computational framework for stream processing.  It uses Apache Kafka for messaging, and Apache Hadoop YARN for fault tolerance, processor isolation, security, and resource

Tigon: Real-time Streaming for the Real World

 14 hours

Tigon is an open-source, real-time, low-latency, high-throughput, native YARN, stream processing framework that sits on top of HDFS and HBase for persistence. Tigon applications address use cases such as network intrusion detection and analytics,