Natural Language Processing

This series of modules introduces learners to natural language processing (NLP).

Check back soon for more details.

Learning Objectives

  1. Enumerate various applications of NLP technology
  2. Perform basic data cleaning tasks, including tokenization, case normalization, punctuation, stop word removal, and stemming and lemmatization.
  3. Describe the classical approaches to text representation
  4. Implement one-hot encoding, term frequencies, and the TD-IDF method in code.
  5. Describe the basic workings of word embeddings (vector) algorithms
  6. Implement the Word2Vec algorithm in code
  7. Discuss the function of the various Word2Vec hyperparameters
  8. Recognize that a given text data set may contain implicit bias