Course Outline

Introduction

Installing and Starting Apache Druid

Druid Architecture and Design

Real-Time Ingestion of Event Data

Sharding and Indexing

Loading Data

Querying Data

Visualizing Data

Running a Distributed Cluster

Druid + Apache Hive

Druid + Apache Kafka

Druid + Others

Troubleshooting

Administrative Tasks

Summary and Conclusion

Requirements

  • A basic understanding of data infrastructure.
  • A general knowledge of distributed systems.
  • Basic Linux command line familiarity.

Audience

  • Application developers
  • Software engineers
  • Technical consultants
  • DevOps professionals
  • Architecture engineers
  21 Hours
 

Testimonials

Related Courses

Advanced Data Analysis with TIBCO Spotfire

 14 hours

TIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire

Introduction to Spotfire

 14 hours

TIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts and data analysts who wish to learn

AI-Driven Data Analysis with TIBCO Spotfire X

 14 hours

TIBCO Spotfire X is a business intelligence and augmented analytics tool that combines natural language, artificial intelligence (AI), and streaming data technologies. This instructor-led, live training (online or onsite) is aimed at business

Data Analysis with SQL, Python and Spotfire

 14 hours

In this instructor-led, live training, participants will learn three different approaches for accessing, analyzing and visualizing data. We start with an introduction to RDMS databases; the focus will be on accessing and querying an Oracle database

Amazon Redshift

 21 hours

Amazon Redshift is a petabyte-scale cloud-based data warehouse service in AWS. In this instructor-led, live training, participants will learn the fundamentals of Amazon Redshift. By the end of this training, participants will be able

Big Data & Database Systems Fundamentals

 14 hours

The course is part of the Data Scientist skill set (Domain: Data and Technology).

MemSQL

 28 hours

MemSQL is an in-memory, distributed, SQL database management system for cloud and on-premises. It's a real-time data warehouse that immediately delivers insights from live and historical data. In this instructor-led, live training,

Data Vault: Building a Scalable Data Warehouse

 28 hours

Data Vault Modeling is a database modeling technique that provides long-term historical storage of data that originates from multiple sources. A data vault stores a single version of the facts, or "all the data, all the time". Its

ELK: Elasticsearch, Logstash and Kibana for Administrators

 14 hours

This instructor-led, live training is aimed at system administrators who wish to set up an ELK stack (Elasticsearch, Logstash, Kibana). Note that a minimum of 3 delegates is needed for this course to run. The training starts with a discussion of

Monitoring with Grafana

 14 hours

Grafana is a visualization and metric analytic suite. Grafana presents a visual representation of real-time data for application analytics. Grafana is most commonly used for home automation, process control, and more. The aim of the course is to

Grafana and Graphite

 14 hours

Grafana is an open source visualization and metric analytic suite for visualizing real-time data of application analytics. Graphite is an open source software for monitoring and graphing computer system data. Together with Grafana and Gaphite, users

Kibana: Essentials

 14 hours

This training introduces Kibana to the users of Elasticsearch. Kibana is an open source analytics and visualization platform designed to work with Elasticsearch. You use Kibana to search, view, and interact with data stored in Elasticsearch

Apache Kylin: From Classic OLAP to Real-Time Data Warehouse

 14 hours

Apache Kylin is an extreme, distributed analytics engine for big data. In this instructor-led live training, participants will learn how to use Apache Kylin to set up a real-time data warehouse. By the end of this training, participants will

TIBCO for Developers

 21 hours

TIBCO is a software suite for integrating, managing, and monitoring enterprise applications and information delivery. Two of its products, BusinessEvents (business rules engine) and ActiveMatrix Service Bus (lightweight enterprise service bus), are

TIBCO Statistica

 14 hours

TIBCO Statistica is an advanced analytics system that provides data analysis, data management, data visualization, data mining, statistics, and machine learning solutions for businesses. This instructor-led, live training (online or onsite) is