Skip to content

Latest commit

 

History

History
22 lines (17 loc) · 982 Bytes

File metadata and controls

22 lines (17 loc) · 982 Bytes

Probability and Naive Bayes

Wednesday 4/30/2014 We'll be reviewing some basics of probability, developing ways to work with text data, and using a classification algorithm to classify text.

Objectives

  • Articulate Naive Bayes' advantages, flaws, applications and theoretical foundation
  • Explain how Naive Bayes is applied to classify text or Spam
  • Be familiar with using the N.B. classifiers in NLTK and SKLearn
  • Create a basic Naive Bayes classifier

Materials

Assignments

  • Add a feature to the NLTK gender classifier to try and improve performance
  • Create a classifier to tell the difference between two authors
  • Brainstorm classification topics for projects (due May 14)