Course Outline

Introduction

  • Overview of Databricks and Apache Spark
  • Understanding the Databricks architecture

Getting Started

  • Setting up the Environment
  • Setting up and configuring Databricks
  • Navigating the Databricks user interface
  • Creating a Databricks workspace

Working with Data in Databricks

  • Connecting to an Apache Spark data source
  • Understanding the basics columns and datatypes
  • Managing file system into Notebooks

Managing Jobs and Clusters

  • Creating and configuring clusters
  • Creating jobs using Notebook
  • Running jobs
  • Viewing jobs and job details

Using Delta Lake in Databricks

  • Loading data into Delta Lake
  • Managing data in Delta Lake

Securing Databricks

  • Managing Databricks security
  • Managing backup and recovery

Troubleshooting

Summary and Next Steps

Requirements

  • Basic understanding of data analytics
  • Knowledge of Apache Spark

Audience

  • Data Engineers
  • Data Scientists
  • Developers
  14 Hours
 

Testimonials

Related Courses

A Practical Introduction to Data Analysis and Big Data

  35 hours

Datameer for Data Analysts

  14 hours

Excel For Statistical Data Analysis

  14 hours

Data and Analytics - from the ground up

  42 hours

NLP: Natural Language Processing with R

  21 hours

SQL Advanced level for Analysts

  21 hours

Data Analytics With R

  21 hours

Elasticsearch for Developers

  14 hours

Data Analysis with Hive/HiveQL

  7 hours

Embedding Projector: Visualizing Your Training Data

  14 hours

kdb+ and q: Analyze Time Series Data

  21 hours

MATLAB Fundamentals, Data Science & Report Generation

  35 hours

Data Analysis with Python, Pandas, and Numpy

  14 hours

Knowledge Discovery in Databases (KDD)

  21 hours

Apache Kylin: From Classic OLAP to Real-Time Data Warehouse

  14 hours