Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added functionality for extracting distinct keywords from list of strings #1

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

Viole-Grace
Copy link

changes:

  • added function to abstract running list of input strings or a single string
  • optimised performance by incorporating nlp.pipe() to form doc, which grants speed advantages for longer doc / large number of input strings
  • updated testing notebook with examples of multiple string extraction
  • updated readme to reflect latest changes for single string keyword extraction + multiple string keyword extraction

@sahyagiri
Copy link
Owner

sahyagiri commented Feb 22, 2022

Thanks a lot for the pull request. At present I plan to keep the API simple with single document and single set of keywords.

  • For multiple documents, can you create a wrapper function, say "distinct_keywords.get_multiple_doc_keywords(docs) ?
  • Keep the example notebook minimal
  • I would welcome any step for single document keyword extraction, that is making it faster and consistent over larger documents

@Viole-Grace
Copy link
Author

changes:

  • added separate functions for running a list of input strings or a single string
  • optimised performance by incorporating nlp.pipe() to form doc, which grants speed advantages for longer doc / large number of input strings for both functions
  • updated testing notebook with examples of multiple string extraction
  • updated readme to reflect latest changes for single string keyword extraction + multiple string keyword extraction

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants