Course Outline

Introduction

Sqoop and Flume Overview

  • What is Sqoop?
  • What is Flume?
  • Sqoop and Flume features

Preparing the Development Environment

  • Installing and configuring Apache Sqoop
  • Installing and configuring Apache Flume

Apache Flume

  • Creating an agent
  • Using spool sources, file channels, and logger sinks
  • Working with events
  • Accessing data sources

Apache Sqoop

  • Importing MySQL to HDFS and Hive
  • Using Sqoop jobs

Data Ingestion Pipelines

  • Building pipelines
  • Fetching data
  • Ingesting data to HDFS

Summary and Conclusion

Requirements

  • Experience with SQL

Audience

  • Software Engineers
  7 Hours
 

Testimonials

Related Courses

Data Virtualization with Denodo Platform

  14 hours

Apache Airflow

  21 hours

Apache Arrow for Data Analysis across Disparate Data Sources

  14 hours

Apache Hama

  14 hours

Zeppelin for Interactive Data Analytics

  14 hours

Apache Accumulo Fundamentals

  21 hours

Apache Kylin: From Classic OLAP to Real-Time Data Warehouse

  14 hours

Dremio for Self-Service Data Analysis

  21 hours

Apache Drill

  21 hours

Apache Drill Performance Optimization and Debugging

  7 hours

Apache Drill Query Optimization

  7 hours

Data Vault: Building a Scalable Data Warehouse

  28 hours

Big Data & Database Systems Fundamentals

  14 hours

Apache Druid for Real-Time Data Analysis

  21 hours