Skip to content
Witold Serwatka edited this page Jul 8, 2020 · 9 revisions

Welcome to the Hyperreal wiki!
Please check the sidebar for more info on the project files.

TO-DO List

  1. datautils -> utils.py - improve it or remove it
  2. datautils -> preprocess.py - make get_data function more ubiqutous
  3. textutils -> ngrams.py - understand and improve distinctive_ngrams function
  4. textutils -> stats.py - improve get_word_freq function, find the appropriate file for it
  5. textutils -> tagging.py - add flair functionalities
  6. textutils -> utils.py - specify the get_lemma_dict file
  7. textutils -> utils.py - make use of clear_tags and find_phrase functions
  8. textutils -> utils.py - remove redundant get_other_form_dict function
  9. textutils -> word2vec.py - improve create_synonym_dict function
  10. overview -> stats.py - remove generate_word_cloud function
  11. overview -> user.py - add threading to get_drug_correlation function

Future wiki expansion

  1. Add library tutorials
  2. Add crawler description pages and tutorial
  3. Add interface description pages