Course Outline

Introduction

Sqoop and Flume Overview

  • What is Sqoop?
  • What is Flume?
  • Sqoop and Flume features

Preparing the Development Environment

  • Installing and configuring Apache Sqoop
  • Installing and configuring Apache Flume

Apache Flume

  • Creating an agent
  • Using spool sources, file channels, and logger sinks
  • Working with events
  • Accessing data sources

Apache Sqoop

  • Importing MySQL to HDFS and Hive
  • Using Sqoop jobs

Data Ingestion Pipelines

  • Building pipelines
  • Fetching data
  • Ingesting data to HDFS

Summary and Conclusion

Requirements

  • Experience with SQL

Audience

  • Software Engineers
 7 Hours

Testimonials (5)

Related Courses

Data Vault: Building a Scalable Data Warehouse

28 Hours

Spark Streaming with Python and Kafka

7 Hours

Confluent KSQL

7 Hours

Apache Ignite for Developers

14 Hours

Unified Batch and Stream Processing with Apache Beam

14 Hours

Apache Apex: Processing Big Data-in-Motion

21 Hours

Apache Storm

28 Hours

Apache NiFi for Administrators

21 Hours

Apache NiFi for Developers

7 Hours

Apache Flink Fundamentals

28 Hours

Python and Spark for Big Data (PySpark)

21 Hours

Introduction to Graph Computing

28 Hours

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

21 Hours

Apache Spark MLlib

35 Hours

Knowledge Discovery in Databases (KDD)

21 Hours

Related Categories