Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Your Dataset Please #6

Open
Josephat90 opened this issue Sep 9, 2020 · 4 comments
Open

Your Dataset Please #6

Josephat90 opened this issue Sep 9, 2020 · 4 comments

Comments

@Josephat90
Copy link

Good day,
thank you for sharing your work, as I have found it to be very useful in my research. however, I am in need of some more information regarding your model and what dataset you used to train it, I am currently writing my dissertation and I will love to give you your deserved credit in my research.
kindly share with me the name of the dataset and the details of the dataset, you do not have to share the dataset with me.
thank you, best regards.

@Megatronicus
Copy link

Where is the dataset? Is he just bragging?

@minto5050
Copy link
Owner

The dataset consisted of NSFW images with nudity download from tumblr and SFW images that are collected from google, I've felt most of the images may have been uploaded without the concent of the people in them and felt wrong about publishing it and violating thier privacy. Kept the trained data here so that if it may become helpful for someone.

@Megatronicus
Copy link

Just to correct you, it's spelled "consent". I was looking at "concent" and thinking what word is this now, lol.

Ok, thanks for explaining it. Best regards.

@Megatronicus
Copy link

The dataset consisted of NSFW images with nudity download from tumblr and SFW images that are collected from google, I've felt most of the images may have been uploaded without the concent of the people in them and felt wrong about publishing it and violating thier privacy. Kept the trained data here so that if it may become helpful for someone.

Hey, is there any way you could send me the dataset through email or upload it to file.io and share the link. I have downloaded about 930,000 images with scrapers, when I removed the duplicates and corrupted images I got left with about 130,000 images, then I manually went through about 5000 to create a model which I used to go through the 130,000 and now I'm weeding out the 13,000 sfw images that were in the 130,000 supposedly all nsfw images. But if I had more I could make a more precise model in the end. Could you send it over somehow?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants