Course Outline

  1. Distribution big data
    1. Data mining methods (training single systems + distributed prediction: traditional machine learning algorithms + Mapreduce distributed prediction)
    2. Apache Spark MLlib
  2. Recommendations and Advertising:
    1. Natural language
    2. Text clustering, text categorization (labeling), synonyms
    3. User profile restore, labeling system
    4. Recommended algorithms
    5. Insuring the accuracy of "lift" between and within categories
    6. How to create closed loops for recommendation algorithms
  3. Logical regression, RankingSVM,
  4. Feature recognition (deep learning and automatic feature recognition for graphics)
  5. Natural language
    1. Chinese word segmentation
    2. Theme model (text clustering)
    3. Text classification
    4. Extract keywords
    5. Semantic analysis, semantic parser, word2vec (vector to word)
    6. RNN long-term memory (TSTM) architecture
  21 Hours
 

Testimonials

Related Courses

Artificial Intelligence Overview

 7 hours

This course has been created for managers, solutions architects, innovation officers, CTOs, software architects and anyone who is interested in an overview of applied artificial intelligence and the nearest forecast for its

OpenNLP for Text Based Machine Learning

 14 hours

The Apache OpenNLP library is a machine learning based toolkit for processing natural language text. It supports the most common NLP tasks, such as language detection, tokenization, sentence segmentation, part-of-speech tagging, named entity

Python: Machine Learning with Text

 21 hours

In this instructor-led, live training, participants will learn how to use the right machine learning and NLP (Natural Language Processing) techniques to extract value from text-based data. By the end of this training, participants will be able

Text Summarization with Python

 14 hours

In Python Machine Learning, the Text Summarization feature is able to read the input text and produce a text summary. This capability is available from the command-line or as a Python API/Library. One exciting application is

Python for Natural Language Generation

 21 hours

Natural language generation (NLG) refers to the production of natural language text or speech by a computer. In this instructor-led, live training, participants will learn how to use Python to produce high-quality natural language text by

Natural Language Processing

 21 hours

This course has been designed for people interested in extracting meaning from written English text, though the knowledge can be applied to other human languages as well. The course will cover how to make use of text written by humans, such as

NLP: Natural Language Processing with R

 21 hours

It is estimated that unstructured data accounts for more than 90 percent of all data, much of it in the form of text. Blog posts, tweets, social media, and other digital publications continuously add to this growing body of data. This

Natural Language Processing with Deep Dive in Python and NLTK

 35 hours

By the end of the training the delegates are expected to be sufficiently equipped with the essential python concepts and should be able to sufficiently use NLTK to implement most of the NLP and ML based operations. The training is aimed at giving

Natural Language Processing with Python

 28 hours

This course introduces linguists or programmers to NLP in Python. During this course we will mostly use nltk.org (Natural Language Tool Kit), but also we will use other libraries relevant and useful for NLP. At the moment we can conduct this

Scaling Data Pipelines with Spark NLP

 14 hours

Spark NLP is an open source library, built on Apache Spark, for natural language processing with Python, Java, and Scala. It is widely used for enterprise and industry verticals, such as healthcare, finance, life science, and recruiting. This

Apache Spark MLlib

 35 hours

MLlib is Spark’s machine learning (ML) library. Its goal is to make practical machine learning scalable and easy. It consists of common learning algorithms and utilities, including classification, regression, clustering, collaborative

NLP with Python and TextBlob

 14 hours

TextBlob is a Python NLP library for processing textual data. It provides a simple API that makes it easy to perform NLP tasks, such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, etc. This

Natural Language Processing with TensorFlow

 35 hours

TensorFlow™ is an open source software library for numerical computation using data flow graphs. SyntaxNet is a neural-network Natural Language Processing framework for TensorFlow. Word2Vec is used for learning vector representations of

NLP with Deeplearning4j

 14 hours

Deeplearning4j is an open-source, distributed deep-learning library written for Java and Scala. Integrated with Hadoop and Spark, DL4J is designed to be used in business environments on distributed GPUs and CPUs. Word2Vec is a method of computing