Course Outline

Introduction

  • Apache Arrow vs Parquet

Installing and Configuring Apache Arrow

Overview of Apache Arrow Features and Architecture

Exploring Data with Pandas and Apache Arrow

Exploring Data with Spark and Apache Arrow

Exploring Data with R and Apache Arrow

Exploring Data with MapD and Apache Arrow

Other Data Analysis Integrations

  • PySpark, Parquet files on S3, and Oracle tables and Elasticsearch indices

Troubleshooting

Summary and Conclusion

Requirements

  • A basic undersanding of SQL
  • Familiarity with Python or R
  • Some familiarity with Apache Spark
  14 Hours
 

Testimonials

Related Courses

SPSS Modeler

  14 hours

Databricks

  14 hours

Microsoft Power Platform Fundamentals

  14 hours

PL-900T00: Microsoft Power Platform Fundamentals

  7 hours

Data Cleaning

  7 hours

Sensu: Beginner to Advanced

  14 hours

Monitoring Your Resources with Munin

  7 hours

Automated Monitoring with Zabbix

  14 hours

Fluentd for Log Data Unification

  14 hours

Nagios Core

  21 hours

Nagios

  35 hours

Nagios XI Administration

  21 hours

Advanced Nagios

  21 hours

Zenoss Monitoring for Administrators

  21 hours

Netdata

  7 hours