Skip to content

Conversation

PJEstrada
Copy link
Contributor

This branch will add 3 main features:

  • DatasetIterator class: will allow all dataset objects to be iterable so we can directly get all the instances on each file in a dataset with a simple forloop.
  • pytorch dataset class: builds on to of the base iterator and create a pytorch dataset object.
  • tensorflow dataset class: creates a tf dataset object.

This version will limit to only bbox and polygon instance types. We will also add the ability to slice a dataset with the query engine an iterate it directly.

@PJEstrada PJEstrada self-assigned this Aug 9, 2021
@PJEstrada PJEstrada added enhancement New feature or request pending_work labels Aug 9, 2021
@PJEstrada PJEstrada removed the request for review from anthony-chaudhary August 25, 2021 15:21
@PJEstrada
Copy link
Contributor Author

This branch is ready for review: some things I want to discuss related to the to_tensorflow():

I've found lots of ways that people ingest data to model so still not 100% sure on what would be a good standard to follow. For now I'm returning a tf.data.example object with the file's instances and image data. Please let me know if you think this is a right direction.

Other from that, I think this branch should be ready:

  • Added Sliced dataset class. For queries in dataset
  • Added the to_pytorch() function
  • Added the to_tensorflow() function
  • Addded the slice() function
  • Added the explore() function

@anthony-chaudhary
Copy link
Member

anthony-chaudhary commented Aug 25, 2021 via email

@PJEstrada PJEstrada merged commit f6b9941 into main Sep 1, 2021
@PJEstrada PJEstrada deleted the queries-and-export-to-ml-libaries branch September 1, 2021 15:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request ready_to_merge
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants