Skip to content

Team 2b2|!2b's project for the HackFMI8 hackathon

Notifications You must be signed in to change notification settings

emil-kirilov/lexis

Repository files navigation

Lexis

Team 2b|!2b's project for the HackFMI8 hackathon

###Dataset mining - more than 150k unique texts

  • Downloaded more than 100k tweets using Twitter API
  • Additional 50k from other sources
  • All data is labeled

###Data preprocessing

  • removed all hashtags, links, user mentions, retweets
  • removed meaningless data
  • removed all stopwords

###Algorithm

  • "Bag of Words" - vectorization
  • Implemented different classification algorithms (SVC, Naive Bayes)
  • Compared and tuned the result
  • Get result of sample input and graph the probabilities
  • Find how to export and import classifiers
  • API - Python
  • GUI - HTML and JS

About

Team 2b2|!2b's project for the HackFMI8 hackathon

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published