Course Outline


  • The value of text-based data

Workflow for a Text-Based Data Science Problem

Choosing the Right Machine Learning Libraries

Overview of NLP Techniques

Preparing a Dataset

Visualizing the Data

Working with Text Data with scikit-learn

Building a Machine Learning Model

Splitting into Train and Test Sets

Applying Linear Regression and Non-Linear Regression

Applying NLP Techniques

Parsing Text Data Using Regular Expressions

Exploring Other Machine Language Approaches

Troubleshooting Text Encoding Issues

Closing Remarks


  • Experience with Python
  • An understanding of machine learning
  • Experience with scikit-learn and pandas
  21 Hours


Related Courses

Data Mining with Weka

  14 hours

AdaBoost Python for Machine Learning

  14 hours

Machine Learning with Random Forest

  14 hours

Machine Learning for Mobile Apps using Google’s ML Kit

  14 hours


  7 hours

Artificial Intelligence (AI) with H2O

  14 hours

H2O AutoML

  14 hours

AutoML with Auto-sklearn

  14 hours

AutoML with Auto-Keras

  14 hours


  14 hours

Google Cloud AutoML

  7 hours

RapidMiner for Machine Learning and Predictive Analytics

  14 hours

Advanced Analytics with RapidMiner

  14 hours

Pattern Recognition

  21 hours

Pattern Matching

  14 hours