Skip to content

Terms of service agreements were used, in lieu of actual content on the web, as a dataset to learn more about writing algorithms.

License

Notifications You must be signed in to change notification settings

ck-earlyrepos/Terms-of-Use

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

64 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Terms-of-Use

This project started as an exploration into the terms of service agreement employed by various tech companies but morphed into an applied algorithm-writing exercise. The initial concept of this project dawned on me after frustrtations with company policies restricting the ability of people to scrape information from their websites. Even if I'm not allowed to access their data, I can still at least study their terms of service, right?

Output

You can see the output here. Note that the links will only work if you download the PDF.

Algorithms Written 'By-Hand'

  • word2vec

  • porter stemming algorithm

  • ngrams tokenization

  • TF-IDF statistic

  • singular value decomposition

About

Terms of service agreements were used, in lieu of actual content on the web, as a dataset to learn more about writing algorithms.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages