Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Introduction
- Spark NLP vs NLTK vs spaCy
- Overview of Spark NLP features and architecture
Getting Started
- Setup requirements
- Installing Spark NLP
- General concepts
Using Pre-trained Pipelines
- Importing required modules
- Default annotators
- Loading a pipeline model
- Transforming texts
Building NLP Pipelines
- Understanding the pipeline API
- Implementing NER models
- Choosing embeddings
- Using word, sentence, and universal embeddings
Classification and Inference
- Document classification use cases
- Sentiment analysis models
- Training a document classifier
- Using other machine learning frameworks
- Managing NLP models
- Optimizing models for low-latency inference
Troubleshooting
Summary and Next Steps
Requirements
- Familiarity with Apache Spark
- Python programming experience
Audience
- Data scientists
- Developers
14 Hours
Testimonials (5)
A lot of practical examples, different ways to approach the same problem, and sometimes not so obvious tricks how to improve the current solution
Rafał - Nordea
Course - Apache Spark MLlib
very interactive...
Richard Langford
Course - SMACK Stack for Data Science
Sufficient hands on, trainer is knowledgable
Chris Tan
Course - A Practical Introduction to Stream Processing
Get to learn spark streaming , databricks and aws redshift
Lim Meng Tee - Jobstreet.com Shared Services Sdn. Bhd.
Course - Apache Spark in the Cloud
practice tasks