Course Outline

Introduction to Data mining and Machine Learning

  • Statistical learning vs. Machine learning
  • Iteration and evaluation
  • Bias-Variance trade-off

Regression

  • Linear regression
  • Generalizations and Nonlinearity
  • Exercises

Classification

  • Bayesian refresher
  • Naive Bayes
  • Dicriminant analysis
  • Logistic regression
  • K-Nearest neighbors
  • Support Vector Machines
  • Neural networks
  • Decision trees
  • Exercises

Cross-validation and Resampling

  • Cross-validation approaches
  • Bootstrap
  • Exercises

Unsupervised Learning

  • K-means clustering
  • Examples
  • Challenges of unsupervised learning and beyond K-means

Advanced topics

  • Ensemble models
  • Mixed models
  • Boosting
  • Examples

Multidimensional reduction

  • Factor Analysis
  • Principal Component Analysis
  • Examples

Requirements

This course is part of the Data Scientist skill set (Domain: Analytical Techniques and Methods)

  14 Hours
 

Testimonials

Related Courses

Data Mining with Weka

  14 hours

AdaBoost Python for Machine Learning

  14 hours

Machine Learning with Random Forest

  14 hours

Machine Learning for Mobile Apps using Google’s ML Kit

  14 hours

DataRobot

  7 hours

Artificial Intelligence (AI) with H2O

  14 hours

H2O AutoML

  14 hours

AutoML with Auto-sklearn

  14 hours

AutoML with Auto-Keras

  14 hours

AutoML

  14 hours

Google Cloud AutoML

  7 hours

RapidMiner for Machine Learning and Predictive Analytics

  14 hours

Pattern Recognition

  21 hours

Pattern Matching

  14 hours

Apache SystemML for Machine Learning

  14 hours