Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compare "relevant" keywords between the advisory text and the commits messages #331

Closed
sacca97 opened this issue Oct 10, 2022 · 2 comments
Closed

Comments

@sacca97
Copy link
Collaborator

sacca97 commented Oct 10, 2022

Using spacy to extract nouns, propn or verbs from the advisory and the commits messages and then looking for matches seems a good start to treat difficult candidates that do not match any other rule.

This issue supersedes #327, #271 and #259.

@copernico
Copy link
Contributor

copernico commented Oct 10, 2022

I would try extracting verbs too and applying stemming (or lemmatisation).

copernico pushed a commit that referenced this issue Nov 3, 2022
- Implemented twin commits, closes #147 
- Using basemodel again (pydantic)
- Fixed the logger, improved log aesthetics and readability
- github and jira issues are collected using their respective APIs (GITHUB token required via .env file)
- Better word extraction and filtering based on tests conducted on the small dataset.
- Report handlers now create nested folders
- updated to python 3.10
- skip GitHub when fetching references
- refactored git and raw_commit modules: all commit IDs, timestamp, parent, message and changed files are
collected with a single call to git executable
- refactored logging

- Fixes:
#339  
#341  
#334  
#331  
#326 
#336
@sacca97
Copy link
Collaborator Author

sacca97 commented Nov 4, 2022

Closed by #342

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants