Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[2.0] writing/reading fixed-shape arrays to chunks #842

Merged
merged 99 commits into from May 13, 2021

Conversation

verbose-void
Copy link
Contributor

@verbose-void verbose-void commented May 7, 2021

support chunked writing (not appending) for np.arrays -> storage providers with the following qualities:

  • batched/unbatched
  • fixed shape (all samples have same shape)

also adds:

  • in .circleci/config.yaml run pytest & pytest-benchmark separately (with --benchmark-skip & --benchmark-only flags)

things this branch does not do:

  • appending (writing arrays to a key that already had arrays written to it)
  • caching
  • index map chunking
  • compression

@verbose-void verbose-void added the enhancement New feature or request label May 7, 2021
@verbose-void verbose-void self-assigned this May 7, 2021
@todo
Copy link

todo bot commented May 7, 2021

@todo
Copy link

todo bot commented May 7, 2021

change storage type to StorageProvider

https://github.com/activeloopai/Hub/blob/8d95e4aca5fea3465e3fcd5af334276450c9394a/hub/core/chunk_engine/read.py#L9-L14


This comment was generated by todo based on a TODO comment in 8d95e4a in #842. cc @activeloopai.

@todo
Copy link

todo bot commented May 7, 2021

read with slice

https://github.com/activeloopai/Hub/blob/8d95e4aca5fea3465e3fcd5af334276450c9394a/hub/core/chunk_engine/read.py#L10-L15


This comment was generated by todo based on a TODO comment in 8d95e4a in #842. cc @activeloopai.

@todo
Copy link

todo bot commented May 7, 2021

don't use pickle

https://github.com/activeloopai/Hub/blob/8d95e4aca5fea3465e3fcd5af334276450c9394a/hub/core/chunk_engine/read.py#L21-L26


This comment was generated by todo based on a TODO comment in 8d95e4a in #842. cc @activeloopai.

@todo
Copy link

todo bot commented May 7, 2021

don't use pickle

https://github.com/activeloopai/Hub/blob/8d95e4aca5fea3465e3fcd5af334276450c9394a/hub/core/chunk_engine/read.py#L25-L30


This comment was generated by todo based on a TODO comment in 8d95e4a in #842. cc @activeloopai.

@todo
Copy link

todo bot commented May 7, 2021

decode from array instead of dictionary

https://github.com/activeloopai/Hub/blob/8d95e4aca5fea3465e3fcd5af334276450c9394a/hub/core/chunk_engine/read.py#L37-L42


This comment was generated by todo based on a TODO comment in 8d95e4a in #842. cc @activeloopai.

.circleci/config.yml Outdated Show resolved Hide resolved
Copy link
Contributor

@benchislett benchislett left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The usual nitpicks

@verbose-void verbose-void merged commit 80074a4 into release/2.0 May 13, 2021
@verbose-void verbose-void deleted the feature/2.0/chunk-engine branch May 13, 2021 04:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants