-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Data] Add performant way to read large tfrecord datasets #42277
[Data] Add performant way to read large tfrecord datasets #42277
Commits on Jan 24, 2024
-
feat: add performant way to read large tfrecord datasets
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for af7b677 - Browse repository at this point
Copy the full SHA af7b677View commit details -
add tfx-bsl as a test dependency
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for f25fa2e - Browse repository at this point
Copy the full SHA f25fa2eView commit details -
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 14ed874 - Browse repository at this point
Copy the full SHA 14ed874View commit details -
properly enable/disable fast read on tests
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 3b8bf91 - Browse repository at this point
Copy the full SHA 3b8bf91View commit details -
resolve rabsolute path from relative
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for b303cac - Browse repository at this point
Copy the full SHA b303cacView commit details -
add tensorflow-io for s3 fs impl
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for ddfca72 - Browse repository at this point
Copy the full SHA ddfca72View commit details -
try adding tfx-bsl, cython in data-test-requirements
Signed-off-by: Scott Lee <sjl@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for c3beb21 - Browse repository at this point
Copy the full SHA c3beb21View commit details
Commits on Jan 25, 2024
-
Signed-off-by: Scott Lee <sjl@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 4ac5221 - Browse repository at this point
Copy the full SHA 4ac5221View commit details -
Apply suggestions from code review
Co-authored-by: Scott Lee <scottjlee@users.noreply.github.com> Signed-off-by: Martin <martinbomio@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 552c2e0 - Browse repository at this point
Copy the full SHA 552c2e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a007c6 - Browse repository at this point
Copy the full SHA 4a007c6View commit details -
Merge branch 'martinbomio/fast-tfrecord-read' of https://github.com/m…
…artinbomio/ray into martinbomio/fast-tfrecord-read
Configuration menu - View commit details
-
Copy full SHA for 3c20737 - Browse repository at this point
Copy the full SHA 3c20737View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf061db - Browse repository at this point
Copy the full SHA bf061dbView commit details -
Signed-off-by: Scott Lee <sjl@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 3d7bd33 - Browse repository at this point
Copy the full SHA 3d7bd33View commit details -
Configuration menu - View commit details
-
Copy full SHA for b8519a1 - Browse repository at this point
Copy the full SHA b8519a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for b6982a3 - Browse repository at this point
Copy the full SHA b6982a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7934adc - Browse repository at this point
Copy the full SHA 7934adcView commit details
Commits on Jan 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 386be26 - Browse repository at this point
Copy the full SHA 386be26View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1d83dce - Browse repository at this point
Copy the full SHA 1d83dceView commit details -
Signed-off-by: Scott Lee <sjl@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 2c49d51 - Browse repository at this point
Copy the full SHA 2c49d51View commit details -
Configuration menu - View commit details
-
Copy full SHA for 953af46 - Browse repository at this point
Copy the full SHA 953af46View commit details -
Configuration menu - View commit details
-
Copy full SHA for c373724 - Browse repository at this point
Copy the full SHA c373724View commit details
Commits on Jan 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for e19c1fc - Browse repository at this point
Copy the full SHA e19c1fcView commit details
Commits on Jan 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for fb6b290 - Browse repository at this point
Copy the full SHA fb6b290View commit details -
Merge branch 'master' into martinbomio/fast-tfrecord-read
Signed-off-by: Scott Lee <sjl@anyscale.com>
Configuration menu - View commit details
-
Copy full SHA for 7d7a08f - Browse repository at this point
Copy the full SHA 7d7a08fView commit details
Commits on Feb 5, 2024
-
rewrite unwrap single value function to use pyarrow
This way we can avoid issues with tensor array conversions as well as cast from large_list to list to avoid issues with to_tf function Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for ca27c49 - Browse repository at this point
Copy the full SHA ca27c49View commit details
Commits on Feb 6, 2024
-
Merge branch 'master' into martinbomio/fast-tfrecord-read
Signed-off-by: Martin <martinbomio@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for aac5ec6 - Browse repository at this point
Copy the full SHA aac5ec6View commit details
Commits on Feb 7, 2024
-
Merge branch 'martinbomio/fast-tfrecord-read' of https://github.com/m…
…artinbomio/ray into martinbomio/fast-tfrecord-read
Configuration menu - View commit details
-
Copy full SHA for 7408c10 - Browse repository at this point
Copy the full SHA 7408c10View commit details
Commits on Feb 13, 2024
-
cast large_list to list always on fast read
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 030556c - Browse repository at this point
Copy the full SHA 030556cView commit details -
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 681f753 - Browse repository at this point
Copy the full SHA 681f753View commit details
Commits on Feb 14, 2024
-
Merge branch 'master' into martinbomio/fast-tfrecord-read
Signed-off-by: Martin <martinbomio@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for bca5bff - Browse repository at this point
Copy the full SHA bca5bffView commit details
Commits on Feb 20, 2024
-
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 8cf1c2d - Browse repository at this point
Copy the full SHA 8cf1c2dView commit details
Commits on Feb 26, 2024
-
rename fast_* variables to tfx_
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 9aa260b - Browse repository at this point
Copy the full SHA 9aa260bView commit details -
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for befc187 - Browse repository at this point
Copy the full SHA befc187View commit details -
add flag in data context to disable using tfx read
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 413c1f0 - Browse repository at this point
Copy the full SHA 413c1f0View commit details
Commits on Feb 27, 2024
-
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 1e2c627 - Browse repository at this point
Copy the full SHA 1e2c627View commit details
Commits on Feb 28, 2024
-
Signed-off-by: Martin Bomio <martinbomio@spotify.com>
Configuration menu - View commit details
-
Copy full SHA for 9894178 - Browse repository at this point
Copy the full SHA 9894178View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf06a37 - Browse repository at this point
Copy the full SHA bf06a37View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf64415 - Browse repository at this point
Copy the full SHA bf64415View commit details -
Configuration menu - View commit details
-
Copy full SHA for 663c39c - Browse repository at this point
Copy the full SHA 663c39cView commit details
Commits on Feb 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 550392e - Browse repository at this point
Copy the full SHA 550392eView commit details