Course Outline

Introduction to Impala

  • What is Impala?
  • How Impala Differs from Relational Databases
  • Limitations and Future Directions
  • Using the Impala Shell
  • The Impala Daemon, Statestore and Catalogue service

Loading Impala

  • Explore a New Impala Instance
  • Load CSV Data from Local Files
  • Point an Impala Table at Existing Data Files

Analyzing Data with Impala

  • Describe the Impala Table
  • Basic Syntax and Querying
  • Data Types
  • Filtering, Sorting, and Limiting Results
  • Joining and Grouping Data
  • Data Loading and Querying Examples
  • Improving Impala Performance
  • How Impala works with Hadoop file formats
  • Hands-On Exercise: Interactive Analysis with Impala

Programming Impala Applications

  • Overview of the Impala SQL Dialect
  • Overview of Impala Programming Interfaces

Troubleshooting Impala

  • Troubleshooting Impala SQL Syntax Issues
  • Troubleshooting I/O Capacity Problems
  • Impala Web User Interface for Debugging

 

 

Requirements

  • knowledge of SQL
  21 Hours
 

Testimonials

Related Courses

Hadoop Administration

  21 hours

Administrator Training for Apache Hadoop

  35 hours

Hadoop Administration on MapR

  28 hours

Hadoop for Developers (4 days)

  28 hours

Advanced Hadoop for Developers

  21 hours

HBase for Developers

  21 hours

Hadoop For Administrators

  21 hours

Hadoop for Business Analysts

  21 hours

Hadoop for Developers and Administrators

  21 hours

Apache Avro: Data Serialization for Distributed Applications

  14 hours

Apache Hadoop: Manipulation and Transformation of Data Performance

  21 hours

Hadoop for Project Managers

  14 hours

Samza for Stream Processing

  14 hours

Alluxio: Unifying Disparate Storage Systems

  7 hours

Tigon: Real-time Streaming for the Real World

  14 hours