Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Map users with the same email to one person #78

Closed
connorshea opened this issue May 5, 2016 · 4 comments
Closed

Map users with the same email to one person #78

connorshea opened this issue May 5, 2016 · 4 comments

Comments

@connorshea
Copy link

My commits are listed as one of two names:

  • Author: connorshea <connor.james.shea@gmail.com>
  • Author: Connor Shea <connor.james.shea@gmail.com>

These are listed as separate users but have the same email address. Unless I'm forgetting an edge case, I don't see any reason for these not to be listed under the same user.

We use a fork of this for GitLab Contributors, so that's where I noticed it.

Thanks for the awesome app :D

@fxn
Copy link
Member

fxn commented May 5, 2016

This could be a good idea to explore indeed.

I guess it would go like this: if Commit#extract_candidates decides to fallback to the commit author, and the email is already present in the commits table, return the canonical name assigned to that one.

The only gotcha that comes to mind is that a few emails like "carlhuda@engineyard.com" map to two contributors. But these are known so maybe we could workaround that.

Would you like to work on it?

@fxn
Copy link
Member

fxn commented May 6, 2016

Actually it might be more complicated. The application does not know which heuristic was applied to the existing commit. For example, the commit could have a hard-coded rule to be associated to someone else, or the commit message could have overriden the author, or the commit could have several authors associated. In those cases this check won't yield a correct association.

@fxn
Copy link
Member

fxn commented May 10, 2016

I have seen more counterexamples.

For example the email "ronnie@dio.jp" has as authors "Akira Matsuda", "Akira Matsuda + Koichi Sasada", "Akira Matsuda & Yukiko Kawamoto", "Akira Matsuda & Kohei Suzuki", "Akira Matsuda & saya", "Akira Matsuda & Naoto Koshikawa". The email "sean@thoughtbot.com" maps to "Sean Griffin", and also to "@schneems and @sgrif".

There are quite a few examples of email addresses from which you cannot automatically assign a canonical name.

I think this is trickier that it seems, by now the app is going to keep ignoring emails and work only based on names.

@fxn fxn closed this as completed May 10, 2016
@connorshea
Copy link
Author

Fair enough, thanks for looking into it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants