Skip to content

Latest commit

 

History

History
22 lines (16 loc) · 732 Bytes

README.md

File metadata and controls

22 lines (16 loc) · 732 Bytes

Twitter Information Retrieval Framework

The project code is under the Twitter-Information-Retrieval-Project directory

The report is under Paper directory

###To run this code the following libs are needed:

  • stanford-corenlp-3.7.0-models.jar

This project comes with two data example for the indexes and the twittes. They are tweetsEN and tweetsEN20 for the tweets and Indexes and Indexes20 for the indexes

###There are test classes for all the main modules as follows:

  1. TestTokenizer.java
  2. TestStopWordsRemover.java
  3. TestEnglishLemmatisation.java
  4. TestPreprocessEnglish.java
  5. TestVocabulary.java
  6. TestCrawler.java
  7. TestIndexer.java
  8. TestSearch.java

Link to the todo list