-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Community data
contributions API
#33
Comments
What you're doing here with AgML is really awesome, and will make using image datasets for testing/developing so much easier! I think this was briefly mentioned some time ago, but it would be great to form a connection between Weed-AI and AgML for the weeds image side of things. Weed-AI now supports annotation through CVAT, so unannotated data can be annotated publicly before being uploaded to the platform. We've also worked on establishing agricultural metadata reporting standards for weeds called AgContext, so each dataset has information on where/how it was collected. There is also version control and dataset editing functions too. One limitation is that it currently is only for weeds, not all the various form of image data used in agriculture currently. Helping make the API is a little beyond my skillset, but if it's something of interest I'd be happy to help some other way. At least it might help make the connection between annotation/upload > standardised metadata > use/editing > download. |
@amogh7joshi / @masonearles where is the s3 bucket located where the data goes now? I can for sure build and API no sweat, but I need a target :) Also for you 2 plus @geezacoleman / other users (To be clear your answers are going to depend on your use case and data sets, so just answer for what you know!)
Excited to get started! |
To be clear are we just looking for an automated API to do all of this stuff? Does the data just live here in the repo? https://github.com/Project-AgML/AgML/blob/main/CONTRIBUTING.md |
Perhaps this should be our target landing spot? https://www.tensorflow.org/datasets |
@KeynesYouDigIt As mentioned offline, the data currently lives in a publicly readable S3 bucket. We manage admin for write. It would be great to create a pipeline for AgML users to contribute data but with a gate for an admin to run a QA check before uploading to the S3 bucket. |
We're opening up this issue regarding how to enable easy, yet high quality, data contributions to AgML. This was raised initially in Issue 15. If you are interested in contributing to this discussion and code development, let's have this conversation below.
The text was updated successfully, but these errors were encountered: