-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spanish language pack #82
Conversation
Hey, thanks for your contribution with the new language package. I appreciate your efforts with the list but could you add the sources to the list generator like descript in the documentation |
It was all done manually as our government doesn't provide with an easy format to extract from. Is it OK if I create a repo with the raw files and add the information in the repo from where and how they were extracted? I will correct the rest of naming errors. |
I found something for the generators. I'll work on that. |
I would be fine if you create a github repository and explain where you got them from. |
The generators work correctly but we should add some pre-processing function to filter compound names. Spanish (ab)uses compound first names. So instead of being named "Antonio", "Jose", "Pedro" your parents choose to call you "Antonio Manuel", "Jose Miguel" or "Pedro Antonio" to honour(keep happy) other family members. And as this is quite popular the list of first names given by the government is plagued by such. For the purpose of the matcher this is useless as any of the compound names will already match the single name from the list. I'm sending a new PR for the Spanish language pack including compound names and I open a new issue on how to deal with compound names. |
As mentioned here wikipedia data is not included in my commit.
|
I'm sorry the wikipedia list needs to return an empty array 🙈 |
Done. |
No description provided.