This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mined repositories languages #18
Comments
Our crawler uses the However, due to GitHub API issues, sometimes we get repositories written in other languages than what we asked for and that may be the cause of confusion. In such cases, we stick with what we searched for and classifies such repositories under the language we filtered on. Let me know if you still have any doubts. |
My doubt was if those languages are the most present ones in decreasing order or a '(semi-)arbitrary' subset. |
Okay, they are chosen based on their popularity, so you can call it a semi-arbitrary design decision. |
Just noticed that Smalltalk/Pharo was not there, but not sure if it can be considered relevant. |
It would be nice to have such a list of languages |
Not sure if it's exactly what we were talking about but here is a list of languages apparently known to GitHub: https://github.com/github/linguist/blob/master/lib/linguist/languages.yml |
Great, I also leave this here for future references: https://madnight.github.io/githut |
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
Is there a way to see if some important languages are excluded from the mining?
I have seen the language stats report in the link 'Mined Projects'.
Are these the 13 most widespread languages and everything else is 'below Kotlin' or there are holes with widespread languages in between?
The text was updated successfully, but these errors were encountered: