Course Outline

Introduction to Apache Drill

How does Apache Drill compare to Spark SQL, Hive and Impala?

Overview of Apache Drill Features and Architecture

  • Apache Drill Components

Understanding Apache Drill Queries

  • Query Execution Process

Performing SQL Queries

  • Connecting to the data source
  • Querying the data

Using the Drill Web Console

  • Query, Profiles, Storage, Metrics, Threads, and Options

Performance Optimization Strategy

  • Identifying the source of performance issues
  • Analyzing Query Plans and Profiles

Apache Drill Query Optimization

  • Optimizing a Query

Limiting the Data that Drill Reads

  • Partitioning the data (partition pruning)

Apache Drill Logging and Debugging

  • Analyzing Drill Error Messages
  • Configuring Log File Options

Troubleshooting Apache Drill

Summary and Conclusion

Requirements

  • A general understanding of Hadoop
  • Experience with Linux command line
  7 Hours
 

Testimonials

Related Courses

Apache Drill

 21 hours

Apache Drill is a schema-free, distributed, in-memory columnar SQL query engine for Hadoop, NoSQL and other Cloud and file storage systems. The power of Apache Drill lies in its ability to join data from multiple data stores using a single query.

Apache Drill Query Optimization

 7 hours

Apache Drill is a schema-free, distributed, in-memory columnar SQL query engine for Hadoop, NoSQL and other Cloud and file storage systems. The power of Apache Drill lies in its ability to join data from multiple data stores using a single query.

Big Data Analytics for Telecom Regulators

 14 hours

To meet compliance of the regulators, CSPs (Communication service providers) can tap into Big Data Analytics which not only help them to meet compliance but within the scope of same project they can increase customer satisfaction and thus

Big Data Architect

 35 hours

Day 1 - provides a high-level overview of essential Big Data topic areas. The module is divided into a series of sections, each of which is accompanied by a hands-on exercise. Day 2 - explores a range of topics that relate analysis practices and

Big Data and its Management Process

 14 hours

Objective : This training course aims at helping attendees understand why Big Data is changing our lives and how it is altering the way businesses see us as consumers. Indeed, users of big data in businesses find that big data unleashes a

Big Data - Data Science

 14 hours

This classroom based training session will explore Big Data. Delegates will have computer based examples and case study exercises to undertake with relevant big data tools

Vespa: Serving Large-Scale Data in Real-Time

 14 hours

Vespa is an open-source big data processing and serving engine created by Yahoo.  It is used to respond to user queries, make recommendations, and provide personalized content and advertisements in real-time. This instructor-led, live

Big Data Business Intelligence for Govt. Agencies

 35 hours

Advances in technologies and the increasing amount of information are transforming how business is conducted in many industries, including government. Government data generation and digital archiving rates are on the rise due to the rapid growth of

Big Data Business Intelligence for Criminal Intelligence Analysis

 35 hours

Advances in technologies and the increasing amount of information are transforming how law enforcement is conducted. The challenges that Big Data pose are nearly as daunting as Big Data's promise. Storing data efficiently is one of these

From Data to Decision with Big Data and Predictive Analytics

 21 hours

Audience If you try to make sense out of the data you have access to or want to analyse unstructured data available on the net (like Twitter, Linked in, etc...) this course is for you. It is mostly aimed at decision makers and people who need to

Programming with Big Data in R

 21 hours

Big Data is a term that refers to solutions destined for storing and processing large data sets. Developed by Google initially, these Big Data solutions have evolved and inspired other similar projects, many of which are available as open-source. R

Big Data Storage Solution - NoSQL

 14 hours

When traditional storage technologies don't handle the amount of data you need to store there are hundereds of alternatives. This course try to guide the participants what are alternatives for storing and analyzing Big Data and what are theirs

A Practical Introduction to Data Analysis and Big Data

 35 hours

Participants who complete this instructor-led, live training will gain a practical, real-world understanding of Big Data and its related technologies, methodologies and tools. Participants will have the opportunity to put this knowledge into

Big Data & Database Systems Fundamentals

 14 hours

The course is part of the Data Scientist skill set (Domain: Data and Technology).

Data Science for Big Data Analytics

 35 hours

Big data is data sets that are so voluminous and complex that traditional data processing application software are inadequate to deal with them. Big data challenges include capturing data, data storage, data analysis, search, sharing, transfer,