Hadoop with Python Training Course

Hadoop is a popular Big Data processing framework. Python is a high-level programming language famous for its clear syntax and code readibility.

In this instructor-led, live training, participants will learn how to work with Hadoop, MapReduce, Pig, and Spark using Python as they step through multiple examples and use cases.

By the end of this training, participants will be able to:

Understand the basic concepts behind Hadoop, MapReduce, Pig, and Spark
Use Python with Hadoop Distributed File System (HDFS), MapReduce, Pig, and Spark
Use Snakebite to programmatically access HDFS within Python
Use mrjob to write MapReduce jobs in Python
Write Spark programs with Python
Extend the functionality of pig using Python UDFs
Manage MapReduce jobs and Pig scripts using Luigi

Audience

Developers
IT Professionals

Format of the course

Part lecture, part discussion, exercises and heavy hands-on practice

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Testimonials (5)

Trainer's preparation & organization, and quality of materials provided on github.

Mateusz Rek - MicroStrategy Poland Sp. z o.o.

Course - Impala for Business Intelligence

practical things of doing, also theory was served good by Ajay

Dominik Mazur - Capgemini Polska Sp. z o.o.

Course - Hadoop Administration on MapR

Liked very much the interactive way of learning.

Luigi Loiacono

Course - Data Analysis with Hive/HiveQL

I mostly liked the trainer giving real live Examples.

Simon Hahn

Course - Administrator Training for Apache Hadoop

The fact that all the data and software was ready to use on an already prepared VM, provided by the trainer in external disks.

vyzVoice

Course - Hadoop for Developers and Administrators

Upcoming Courses

Hadoop with Python Training Course

Course Outline

Requirements

Testimonials (5)

Mateusz Rek - MicroStrategy Poland Sp. z o.o.

Course - Impala for Business Intelligence

Dominik Mazur - Capgemini Polska Sp. z o.o.

Course - Hadoop Administration on MapR

Luigi Loiacono

Course - Data Analysis with Hive/HiveQL

Simon Hahn

Course - Administrator Training for Apache Hadoop

vyzVoice

Course - Hadoop for Developers and Administrators

Upcoming Courses

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Hadoop with Python Training Course

Course Outline

Requirements

Testimonials (5)

Mateusz Rek - MicroStrategy Poland Sp. z o.o.

Course - Impala for Business Intelligence

Dominik Mazur - Capgemini Polska Sp. z o.o.

Course - Hadoop Administration on MapR

Luigi Loiacono

Course - Data Analysis with Hive/HiveQL

Simon Hahn

Course - Administrator Training for Apache Hadoop

vyzVoice

Course - Hadoop for Developers and Administrators

Upcoming Courses

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Hadoop with Python

Related Courses

Hortonworks Data Platform (HDP) for Administrators

Apache Ambari: Efficiently Manage Hadoop Clusters

Impala for Business Intelligence

Data Analysis with Hive/HiveQL

Hadoop Administration

Course goal:

Administrator Training for Apache Hadoop

Audience:

Goal:

Hadoop Administration on MapR

Audience:

Hadoop for Developers (4 days)

Advanced Hadoop for Developers

HBase for Developers

Hadoop For Administrators

Audience

Format

Hadoop for Developers and Administrators

Hadoop for Project Managers

Samza for Stream Processing

Datameer for Data Analysts

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites