Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(#227): samples-filter integration #306

Merged
merged 3 commits into from
May 8, 2024

Conversation

h1alexbel
Copy link
Contributor

@h1alexbel h1alexbel commented May 8, 2024

@yegor256, take a look, please

I've developed samples-filter command-line tool for filtering repositories.csv. We support both models: ML model based on Random-Forest algorithm, and Transformer model. We trained them on dataset of descriptions and READMEs of public GitHub repositories. In this pr, I've introduced integration with that tool by using transformer model. Let's see how it will perform on filtering by repos description.

closes #227

@yegor256 yegor256 merged commit 471562a into yegor256:master May 8, 2024
10 checks passed
@h1alexbel h1alexbel deleted the filter-it branch May 8, 2024 15:10
@yegor256
Copy link
Owner

yegor256 commented May 8, 2024

@h1alexbel excellent work!

@yegor256
Copy link
Owner

yegor256 commented May 8, 2024

@h1alexbel would be nice to add a paragraph about it, into tex/report.tex

@h1alexbel
Copy link
Contributor Author

@yegor256 created #307 for this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

filter out repositories with samples
2 participants