Course Outline

Introduction

Overview of Apache Spark Features and Architecture

  • Apache Spark modules: Spark SQL, Spark Streaming, MLlib, GraphX
  • RDD, Dataframes, drive-workers, DAG, etc.

Setting up Apache Spark on .NET

  • Preparing the Java VM
  • Running .NET for Apache Spark using .NET Core

Getting Started

  • Creating a sample .NET console application
  • Adding the Spark driver
  • Initializing a SparkSession
  • Executing the application

Preparing Data

  • Building a data preparation pipeline
  • Performing ETL (Extract, Transform, and Load)

Machine Learning

  • Building a machine learning model
  • Preparing the data
  • Training a model

Real-time Processing

  • Processed streaming data in real-time
  • Case study: monitoring sensor data

Interactive Query

  • Working with Spark SQL
  • Analyzing structured data

Visualizing Results

  • Plotting results
  • Using third-party tools to visualize results

Troubleshooting

Summary and Conclusion

Requirements

  • .NET programming experience using C# or F#

Audience

  • Developers
  21 Hours
 

Testimonials

Related Courses

Python and Spark for Big Data (PySpark)

  21 hours

Introduction to Graph Computing

  28 hours

Apache Spark MLlib

  35 hours

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

  21 hours

Machine Learning and AI with ML.NET

  21 hours

Introduction to Blazor

  14 hours

Blazor WebAssembly

  14 hours

Advanced Blazor

  21 hours

Implementing the Actor Model with Microsoft Orleans

  14 hours

Introduction to .Net Core

  14 hours

.NET Core and Angular Training Course

  21 hours

High-Performance Application Development with .NET Core

  14 hours

Entity Framework Core 2.0

  14 hours

Visual Studio with VB.Net

  21 hours

Developing Desktop Applications with Visual Studio 2012, VB.NET and SQL Server 2012

  21 hours