Course Outline

Introduction

Installing and Configuring Zeppelin

Overview of Zeppelin Features and Architecture

Navigating the Browser Interface

Understanding the Data Analysis Workflow

Organizing Data for Analysis

Visualizing Data

Sharing Data and Collaborating with Other Analysists

Working with Plug-ins

Backend Data Processing

Working with Scala and Apache Spark

Working with Python and Apache Spark

Working with Spark SQL

Working with JDBC

Using Markdown and Shell

Integrating Zeppelin with Spark, Flink and Map Reduce

Setting up and Securing Multi-user Instances

Troubleshooting

Summary and Conclusion

Requirements

  • An understanding of big data concepts
  • Experience with Spark and Hadoop
  • Experience with the command line
  14 Hours
 

Testimonials

Related Courses

Advanced Data Analysis with TIBCO Spotfire

 14 hours

TIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts who wish to learn advanced Spotfire

Introduction to Spotfire

 14 hours

TIBCO Spotfire Analyst is a web and Windows based client application for analyzing data and creating analytic dashboards. This instructor-led, live training (online or onsite) is aimed at business analysts and data analysts who wish to learn

AI-Driven Data Analysis with TIBCO Spotfire X

 14 hours

TIBCO Spotfire X is a business intelligence and augmented analytics tool that combines natural language, artificial intelligence (AI), and streaming data technologies. This instructor-led, live training (online or onsite) is aimed at business

Data Analysis with SQL, Python and Spotfire

 14 hours

In this instructor-led, live training, participants will learn three different approaches for accessing, analyzing and visualizing data. We start with an introduction to RDMS databases; the focus will be on accessing and querying an Oracle database

Data Visualization

 28 hours

This course is intended for engineers and decision makers working in data mining and knoweldge discovery. You will learn how to create effective plots and ways to present and represent your data in a way that will appeal to the decision makers

Introduction to Data Visualization with R

 28 hours

This course is intended for data engineers, decision makers and data analysts and will lead you to create very effective plots using R studio that appeal to decision makers and help them find out hidden information and take the right

Octave not only for programmers

 21 hours

Course is dedicated for those who would like to know an alternative program to the commercial MATLAB package. The three-day training provides comprehensive information on moving around the environment and performing the OCTAVE package for data

Apache Druid for Real-Time Data Analysis

 21 hours

Apache Druid is an open-source, column-oriented, distributed data store written in Java. It was designed to quickly ingest massive quantities of event data and execute low-latency OLAP queries on that data. Druid is commonly used in business

ELK: Elasticsearch, Logstash and Kibana for Administrators

 14 hours

This instructor-led, live training is aimed at system administrators who wish to set up an ELK stack (Elasticsearch, Logstash, Kibana). Note that a minimum of 3 delegates is needed for this course to run. The training starts with a discussion of

Embedding Projector: Visualizing Your Training Data

 14 hours

Embedding Projector is an open-source web application for visualizing the data used to train machine learning systems. Created by Google, it is part of TensorFlow. This instructor-led, live training introduces the concepts behind Embedding

Monitoring with Grafana

 14 hours

Grafana is a visualization and metric analytic suite. Grafana presents a visual representation of real-time data for application analytics. Grafana is most commonly used for home automation, process control, and more. The aim of the course is to

Grafana and Graphite

 14 hours

Grafana is an open source visualization and metric analytic suite for visualizing real-time data of application analytics. Graphite is an open source software for monitoring and graphing computer system data. Together with Grafana and Gaphite, users

Kibana: Essentials

 14 hours

This training introduces Kibana to the users of Elasticsearch. Kibana is an open source analytics and visualization platform designed to work with Elasticsearch. You use Kibana to search, view, and interact with data stored in Elasticsearch

TIBCO for Developers

 21 hours

TIBCO is a software suite for integrating, managing, and monitoring enterprise applications and information delivery. Two of its products, BusinessEvents (business rules engine) and ActiveMatrix Service Bus (lightweight enterprise service bus), are

TIBCO Statistica

 14 hours

TIBCO Statistica is an advanced analytics system that provides data analysis, data management, data visualization, data mining, statistics, and machine learning solutions for businesses. This instructor-led, live training (online or onsite) is