Course Outline

Introduction

  • Overview of Dask features and advantages
  • Parallel computing in Python

Getting Started

  • Installing Dask
  • Dask libraries, components, and APIs
  • Best practices and tips

Scaling NumPy, SciPy, and Pandas

  • Dask arrays examples and use cases
  • Chunks and blocked algorithms
  • Overlapping computations
  • SciPy stats and LinearOperator
  • Numpy slicing and assignment
  • DataFrames and Pandas

Dask Internals and Graphical UI

  • Supported interfaces
  • Scheduler and diagnostics
  • Analyzing performance
  • Graph computation

Optimizing and Deploying Dask

  • Setting up adaptive deployments
  • Connecting to remote data
  • Debugging parallel programs
  • Deploying Dask clusters
  • Working with GPUs
  • Deploying Dask on cloud environments

Troubleshooting

Summary and Next Steps

Requirements

  • Experience with data analysis
  • Python programming experience

Audience

  • Data scientists
  • Software engineers
  14 Hours
 

Testimonials

Related Courses

Automated Monitoring with Zabbix

  14 hours

Databricks

  14 hours

Data Cleaning

  7 hours

Datadog Monitoring

  7 hours

Netdata

  7 hours

Zenoss Monitoring for Administrators

  21 hours

Fluentd for Log Data Unification

  14 hours

KNIME Analytics Platform for BI

  21 hours

Microsoft Power Platform Fundamentals

  14 hours

Monitoring Your Resources with Munin

  7 hours

Nagios

  35 hours

Nagios Core

  21 hours

Nagios XI Administration

  21 hours

Sensu: Beginner to Advanced

  14 hours

SPSS Modeler

  14 hours