Skip to content

Latest commit

 

History

History
18 lines (10 loc) · 778 Bytes

README.md

File metadata and controls

18 lines (10 loc) · 778 Bytes

Terms-of-Use

This project started as an exploration into the terms of service agreement employed by various tech companies but morphed into an applied algorithm-writing exercise. The initial concept of this project dawned on me after frustrtations with company policies restricting the ability of people to scrape information from their websites. Even if I'm not allowed to access their data, I can still at least study their terms of service, right?

Output

You can see the output here. Note that the links will only work if you download the PDF.

Algorithms Written 'By-Hand'

  • word2vec

  • porter stemming algorithm

  • ngrams tokenization

  • TF-IDF statistic

  • singular value decomposition