-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Map users with the same email to one person #78
Comments
This could be a good idea to explore indeed. I guess it would go like this: if Commit#extract_candidates decides to fallback to the commit author, and the email is already present in the commits table, return the canonical name assigned to that one. The only gotcha that comes to mind is that a few emails like "carlhuda@engineyard.com" map to two contributors. But these are known so maybe we could workaround that. Would you like to work on it? |
Actually it might be more complicated. The application does not know which heuristic was applied to the existing commit. For example, the commit could have a hard-coded rule to be associated to someone else, or the commit message could have overriden the author, or the commit could have several authors associated. In those cases this check won't yield a correct association. |
I have seen more counterexamples. For example the email "ronnie@dio.jp" has as authors "Akira Matsuda", "Akira Matsuda + Koichi Sasada", "Akira Matsuda & Yukiko Kawamoto", "Akira Matsuda & Kohei Suzuki", "Akira Matsuda & saya", "Akira Matsuda & Naoto Koshikawa". The email "sean@thoughtbot.com" maps to "Sean Griffin", and also to "@schneems and @sgrif". There are quite a few examples of email addresses from which you cannot automatically assign a canonical name. I think this is trickier that it seems, by now the app is going to keep ignoring emails and work only based on names. |
Fair enough, thanks for looking into it. |
My commits are listed as one of two names:
Author: connorshea <connor.james.shea@gmail.com>
Author: Connor Shea <connor.james.shea@gmail.com>
These are listed as separate users but have the same email address. Unless I'm forgetting an edge case, I don't see any reason for these not to be listed under the same user.
We use a fork of this for GitLab Contributors, so that's where I noticed it.
Thanks for the awesome app :D
The text was updated successfully, but these errors were encountered: