Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

autor normalization: concatenated initials #30

Open
michamos opened this issue Nov 27, 2017 · 2 comments
Open

autor normalization: concatenated initials #30

michamos opened this issue Nov 27, 2017 · 2 comments

Comments

@michamos
Copy link
Contributor

From @jacquerie on August 26, 2017 23:9

From @annetteholtkamp on August 18, 2017 15:13

Expected Behavior

A name with concatenated capital initials like "Fitzakerley, DW" should be converted to
"Fitzakerley, D.W."
ex: https://inspirehep.net/record/1488707

This conversion should only happen, if the family name is not all caps since we may encounter Chinese names like "SHI, YU"

Context

Currently, cataloguers have to change this manually which is painful for long author lists and may easily be overlooked. In the current system I believe this may generate superfluous new author profiles.

Copied from original issue: inspirehep/inspire-next#2662

Copied from original issue: inspirehep/inspire-schemas#221

@michamos
Copy link
Contributor Author

From @jacquerie on August 26, 2017 23:9

From @kaplun on August 22, 2017 9:5

Is this happening in general (hence to be added to the workflow) or is only within certain sources (e.g. arXiv) (hence to be fixed in a crawler?)

@michamos
Copy link
Contributor Author

From @annetteholtkamp on August 27, 2017 9:57

Difficult to say. It surely can happen for scanned conference proceedings. I can look around a bit.

  • Annette

On 27 Aug 2017, at 01:09, Jacopo Notarstefano <notifications@github.commailto:notifications@github.com> wrote:

From @kaplunhttps://github.com/kaplun on August 22, 2017 9:5

Is this happening in general (hence to be added to the workflow) or is only within certain sources (e.g. arXiv) (hence to be fixed in a crawler?)


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHubinspirehep/inspire-schemas#221 (comment), or mute the threadhttps://github.com/notifications/unsubscribe-auth/AM1-O8RmQCq8lnZhxPf-Z7KktqKOrh2tks5scKWcgaJpZM4PDpVt.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant