Skip to content

xnorax/ml_class

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 

Repository files navigation

An Introduction to Machine Learning with Web Data

Glossary

  • Stemming: Reducing words to linguistic stems.
  • parametric:

Algorithms

  • Naive Bayes

  • K Nearest Neighbor: fails in text data

  • SVM

  • Boosting: combine weak-learners to create a strong-learner

  • Agglomerative Clustering

  • Divisive Clustering

  • K Means Clustering

  • Euclidean

    for things that can be represented in euclidean space or to measure similarity in space

  • Cityblock

    for binary

  • Jaccard

    for completely random f

Hacks

  • Use wordnet to bootstrap a larger, more descriptive feature set for text data.

Resources

About

code for my O'Reilly masterclass videos

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%