Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Original data source? #22

Closed
NickCrews opened this issue Feb 16, 2022 · 3 comments
Closed

Original data source? #22

NickCrews opened this issue Feb 16, 2022 · 3 comments

Comments

@NickCrews
Copy link

Hi! I'm trying to create a Nickname database, similar to https://github.com/carltonnorthern/nickname-and-diminutive-names-lookup, but I'd like it to be based on actual real world data. I think that the facebook data dump could be useful for this, if I could link multiple accounts to having two different names, e.g. FB ID 12345 with a name of "Stephen" in one place and "Steve" in another. Not sure if the data is actually formatted in a way that would be useful. Could you explain where you got the data dump?

@philipperemy
Copy link
Owner

philipperemy commented Feb 16, 2022

@NickCrews It's here:

Regarding Nicknames I didn't keep them but I can show you how the original data source looks like:

https://pastebin.com/r2UbU82J

I pasted 100 lignes containing the world "Stephen"

@NickCrews
Copy link
Author

Thanks @philipperemy! This is helpful. Actually, if there is only one FB ID in the dataset (which is looks like there is, and which makes sense if the scrape happened at one single.point in time), then the technique I was thinking of wouldn't work. But thanks anyways, maybe I can think of another method. 😊

@philipperemy
Copy link
Owner

@NickCrews Good luck!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants