Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add the the Fine-Grained Visual Categorization IMET 2020 dataset #115

Closed
mikayelh opened this issue Oct 19, 2020 · 19 comments
Closed

Add the the Fine-Grained Visual Categorization IMET 2020 dataset #115

mikayelh opened this issue Oct 19, 2020 · 19 comments
Assignees

Comments

@mikayelh
Copy link
Collaborator

Describe the dataset

Add IMET 2020 FGVC7 dataset to Hub. So this would work.

import hub
ds = hub.load("username/imet-2020-fgvc7")

Steps

  1. Please take a look at the docs on uploading datasets.

  2. Uploading script should be added to examples folder

Example

You can find an example of large dataset loading and upload here:

@harshitsankhla
Copy link

harshitsankhla commented Oct 19, 2020

Hi @mikayelh , please assign me to this ! Thanks :)

@mikayelh
Copy link
Collaborator Author

mikayelh commented Oct 19, 2020

Hi @harshitsankhla Thank you so much for your willingness to contribute! Let me know if you have any questions, and please star our package while you're at it.

@mikayelh
Copy link
Collaborator Author

mikayelh commented Oct 21, 2020

Hi, @harshitsankhla! Hope this finds you well. Dropping a note to check in on you an ask if you need a hand with uploading the dataset. Feel free to ask us in the GitHub Discussions (we have beta access!) or our dedicated Slack channel. Thanks a mil!

@harshitsankhla
Copy link

@mikayelh I'll have a go later in the day. If I find any problems will ping you here or on slack.

@mikayelh
Copy link
Collaborator Author

hi @harshitsankhla! Thanks a lot, let me know if you any further questions! :)

@AbhinavTuli
Copy link
Contributor

@harshitsankhla Here's a tutorial for uploading datasets using Hub that might be helpful for you!

@harshitsankhla
Copy link

@mikayelh @AbhinavTuli I've tried downloading the dataset a couple of times, unsuccessfully. The connection to the host server doesn't seem all good. Do you have any alternative locations for this ?

@mikayelh
Copy link
Collaborator Author

Hi @harshitsankhla - I'm currently looking for other sources and will update you in about two hours. It doesn't look like they've published the dataset anywhere else. I'll let you know - if we don't manage to find an alternative source, you can pick another dataset!

@mikayelh
Copy link
Collaborator Author

Hey @harshitsankhla , what browser are you using for the download? I've just managed to download the entire dataset via Chrome (it allows you to resume the download if something fails). I would be happy to re-upload it, but it's going to take about 10 hours on my internet connection. Can you please give it a try via Chrome and let me know if it works?

@harshitsankhla
Copy link

I was using safari and the kaggle api. I'll try with chrome now.

@mikayelh
Copy link
Collaborator Author

mikayelh commented Oct 25, 2020

In any case, I've already kickstarted the upload for you to our Google Drive, but it will take quite some time - just let me know how it goes! :)

@davidbuniat
Copy link
Member

@harshitsankhla happy to provide access to an AWS instance for you to download the data and do the processing. Feel free to join our slack channel and claim one

@mikayelh mikayelh added this to Datasets in Hacktoberfest Oct 26, 2020
@mikayelh
Copy link
Collaborator Author

mikayelh commented Oct 26, 2020

Hey, @harshitsankhla - I've uploaded the dataset for you to access to our google drive. Let me know if this works for you.

@harshitsankhla
Copy link

Hey, @harshitsankhla - I've uploaded the dataset for you to access to our google drive. Let me know if this works for you.

Are you sure this is the right file ? Its just a 1MB CSV, I thought the dataset is 23GBs

@mikayelh
Copy link
Collaborator Author

Oops, apologies! I've updated the link. If this doesn't work for you, you can go ahead and pick another dataset!

@mikayelh
Copy link
Collaborator Author

Hey @harshitsankhla dropping a note to see if you're still interested in contributing! I would also love to invite you to the launch event of Hub v1.0, DM me here or in our slack community if interested!

@harshitsankhla
Copy link

Hi @mikayelh will finish this week

@mikayelh
Copy link
Collaborator Author

mikayelh commented Dec 2, 2020

Supercool, @harshitsankhla, thanks!

@mynameisvinn
Copy link
Contributor

Closing this feature request due to inactivity and lack of interest. Will revive it if more users request it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
No open projects
Development

No branches or pull requests

5 participants