Course Outline

Introduction

Overview of Apache Spark Features and Architecture

  • Apache Spark modules: Spark SQL, Spark Streaming, MLlib, GraphX
  • RDD, Dataframes, drive-workers, DAG, etc.

Setting up Apache Spark on .NET

  • Preparing the Java VM
  • Running .NET for Apache Spark using .NET Core

Getting Started

  • Creating a sample .NET console application
  • Adding the Spark driver
  • Initializing a SparkSession
  • Executing the application

Preparing Data

  • Building a data preparation pipeline
  • Performing ETL (Extract, Transform, and Load)

Machine Learning

  • Building a machine learning model
  • Preparing the data
  • Training a model

Real-time Processing

  • Processed streaming data in real-time
  • Case study: monitoring sensor data

Interactive Query

  • Working with Spark SQL
  • Analyzing structured data

Visualizing Results

  • Plotting results
  • Using third-party tools to visualize results

Troubleshooting

Summary and Conclusion

Requirements

  • .NET programming experience using C# or F#

Audience

  • Developers
  21 Hours
 

Testimonials

Related Courses

Advanced Blazor

 21 hours

Blazor is an open-source web framework written in C# that enables developers to create interactive web user interfaces and applications using C#. Entity Framework (EF) Core is an object-database mapper for .NET that provides developers with an

Blazor WebAssembly

 14 hours

Blazor is an open-source web framework written in C# that enables developers to create interactive web user interfaces and applications using C#. This instructor-led, live training (online or onsite) is aimed at web developers who wish to use

Introduction to Blazor

 14 hours

Blazor is an open source web framework written in C#. Blazor enables developers to create interactive web user interfaces and applications using C#. This instructor-led, live training (online or onsite) is aimed at web developers who wish to use

Artificial Intelligence - the most applied stuff - Data Analysis + Distributed AI + NLP

 21 hours

This course is aimed at developers and data scientists who wish to understand and implement AI within their applications. Special focus is given to Data Analysis, Distributed AI and

Apache Spark MLlib

 35 hours

MLlib is Spark’s machine learning (ML) library. Its goal is to make practical machine learning scalable and easy. It consists of common learning algorithms and utilities, including classification, regression, clustering, collaborative

High-Performance Application Development with .NET Core

 14 hours

.NET Core is one of the major software components of the .NET development platform by Microsoft. It allows users to execute applications on multiple operating systems without the risk of regressions present in the classic .NET Framework. Among the

Introduction to .Net Core

 14 hours

.NET Core is an open-source framework for creating .NET applications on Windows, macOS and Linux. In this instructor-led, live training, participants will learn the fundamentals of .Net Core as they step through the creation of a sample

.NET Core and Angular Training Course

 21 hours

.NET Core is an open-source framework for creating .NET applications on Windows, macOS and Linux. In this instructor-led, live training, participants will learn the .Net Core as they step through the creation of a sample application that

Developing Desktop Applications with Visual Studio 2012, VB.NET and SQL Server 2012

 21 hours

This course is divided into 3 main sections and is made up of a mixture of presentations and practical exercises. VB.NET Language in Visual Studio 2012 VB.NET Object Orientation VB.NET and Sql Server

Visual Studio with VB.Net

 21 hours

Microsoft Visual Studio is an integrated development environment (IDE).  VB.NET (Visual Basic .NET) is a multi-paradigm, object-oriented programming language that is implemented on the .NET

Entity Framework Core 2.0

 14 hours

Entity Framework Core 2.0 is a lightweight, cross-platform ORM (Object Relational Model) for developing data-oriented applications. It allows developers to work with data as .Net objects and properties. In this instructor-led, live training,

Introduction to Graph Computing

 28 hours

Many real world problems can be described in terms of graphs. For example, the Web graph, the social network graph, the train network graph and the language graph. These graphs tend to be extremely large; processing them requires a specialized set

Machine Learning and AI with ML.NET

 21 hours

ML.NET is a framework for machine learning applications built upon the .NET development platform by Microsoft. It is extensible to multiple operating systems and can perform various machine learning prediction tasks. The central ML.NET tools are the

Implementing the Actor Model with Microsoft Orleans

 14 hours

Microsoft Orleans is an actor framework (similar to AKKA.net) for building a stateful middle tier. This instructor-led, live training (online or onsite) is aimed at developers who wish to use Microsoft Orleans to build distributed enterprise .NET

Python and Spark for Big Data (PySpark)

 21 hours

Python is a high-level programming language famous for its clear syntax and code readibility. Spark is a data processing engine used in querying, analyzing, and transforming big data. PySpark allows users to interface Spark with Python. In this