Skip to content

Latest commit

 

History

History
21 lines (19 loc) · 447 Bytes

NextStep.md

File metadata and controls

21 lines (19 loc) · 447 Bytes

Twitter-Information-Retrieval

To-do List:
  • Make the crawler don't seach throught the user more than once
  • Store tweets
  • Load tweets
  • Expand text
  • Solve acronyms
  • Index tweets
    • Lexical analysis
    • Remove stopwords
    • Lemmatisation
    • Controlled vocabulary
    • Index Structure
  • Store index
  • Load index
  • Add tf-idf
  • Search
  • Write paper

Link to the README