Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

release deltalake 0.4 #319

Closed
wants to merge 74 commits into from
Closed

release deltalake 0.4 #319

wants to merge 74 commits into from

Conversation

houqp
Copy link
Member

@houqp houqp commented Jul 18, 2021

Description

Get a new release out before we upgrade to arrow/parquet to 5.x

QP Hou and others added 30 commits May 7, 2021 20:49
This isn't related to #222 but I saw the missing integration test coverage
…r files in a partition

This commit also adds an integration test but I have not yet manually tested
this with Azure or S3 storage providers

Fixes #222
… and losing data

This lays the groundwork exposing file stats to consumers of the public Rust API

See #45
This data set contains file statistics. Apparently none of the golden data sets,
nor our pre-existing data sets have per-file statistics in them, making them
very unhelpful for tests :)

Incorporated with permission from @dennyglee
Co-authored-by: R. Tyler Croy <rtyler@brokenco.de>
I deleted these resources in my Azure account 🤷
…he "add" action

This API might receive some changes to accommodate users which need to provide
`txn` actions along with the add_file
…actions

This should support use-cases where the caller needs to put a txn or two into
the log
…level writer interface

I envision this writers module encapsulating a few different flavors of
high-level writers. The exact syntax/API surface that should be common for all
the high-level writers should be largely identical but I'm not ready to
stabilize around what's here for that.

I also am on the fence on how this should handle (or not handle) `txn` actions
…, and finish up BufferedJSONWriter

There are some optimizations that come to mind for this work, but at this point
I think it's ready for simple high-level JSON writer usage
This commit also addresses a number of other code review comments
xianwill and others added 22 commits June 15, 2021 15:43
* Added .to_pandas to deltalake python

* Fixed linting issue

* Changed pandas import. Check pandas only during type checking

* Ignore pd.DataFrame during type check
also extend typechecks to python tests
…295)

* Make dynamodb lease_duration optional to support non-expirable lock
* Add do_not_timeout_on_non_expirable_locks and unit tests
* Refactor has_timed_out options check
* Remove unneeded do_not_timeout_on_non_expirable_locks option
* Add delta-checkpoint release workflow
…ics for Datafusion `TableProvider` (#308)

* Expose delta statistics

* Start using statistics from delta
* Expose, calculate number of records in table provider
* Add TODO
@houqp
Copy link
Member Author

houqp commented Jul 18, 2021

anyone remember why we named the release branch 3.x?

@houqp houqp closed this Jul 18, 2021
@houqp houqp deleted the qp_release branch July 18, 2021 07:00
@houqp
Copy link
Member Author

houqp commented Jul 18, 2021

OK, i think this was a typo, was supposed to be 0.3.x :(

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet