-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compare patch recipients vs. get_maintainer.pl
recommendation
#21
Comments
On small scale (for getting started): To start understanding what needs to be done, we can just quickly run this investigation on a few hundred patches (selected from one week of an release candidate, where we checked or can just assume the MAINTAINERS file to be not changing). Doing this at large scale will be more difficult because we would really like to collect this information for every patch emails and there are millions of patch emails, so we need to tweak and tune the operation. |
Since this issue has been opened @rralf has created the LinuxMaintainers class. Can we leverage that instead of |
Aah after going over the codebase a bit more I see the module |
A classification approach : Please object me if this solution is inappropriate. |
@ShubhamPandey28 I fear you are providing such a dense description that we are not able to judge if this makes sense. How about starting with writing a script to identify significant systematic differences between get_maintainers.pl and email recipients? If you need to use vectors, go ahead, but you would really need to first describe your model. |
I have observed that in some cases, the authors of a patch are mailing the patches to addresses/recipients which are not present in the MAINTAINER file. In such a scenario, it is almost impossible to suggest the appropriate mailing list by running get_maintainer.pl script. Is there any possible solution to this problem? |
@quantum109678 Well, you can just add that person to the MAINTAINERS file, right? The real question is how can you train a "suitable" model on a set of files in a directory and keywords in patches for recipients and then determine the "minimal invasive" but "maximal effective" change in the MAINTAINERS file. I would be surprised if some crazy machine learning could solve that, but let us start easy and collect where there is a systematic difference, e.g., in 80% of the patches, between the recipient list of all patches belonging to a MAINTAINERS section and the information in the MAINTAINERS section. That probably already indicates that the entry might need adjustment. |
I have formulated issue #65 as nice side investigation to this question here. |
Motivation:
If we find systematic shortcomings between to whom the patches are usually sent to and
get_maintainer.pl
recommendation, we can improveget_maintainer.pl
in this regard.The text was updated successfully, but these errors were encountered: