-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Write Delta Lake "operationMetrics" Transaction Log Field #12005
Comments
what should go into this field then? cc @vkorukanti |
cc @claudiusli also cc @alexjo2144 @ilfrin |
@vkorukanti any new thoughts on this w.r.t. https://databricks.com/blog/2022/06/30/open-sourcing-all-of-delta-lake.html ? |
Apologies for not getting back on time. The Delta-on-Spark opensource project already has metrics defined here written as part of the commit. Regarding whether they should be part of the protocol: ideally they should be, we haven't documented them yet. These are evolving frequently based on the need. Also these metrics are currently a bag of json fields, so any implementation expected to handle missing fields or extra fields. |
The operation metrics are also listed in the https://trino.io/docs/current/connector/delta-lake.html#history-table trino/plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/DeltaLakeHistoryTable.java Line 69 in 160400a
|
Delta Lake has a commit field called operationMetrics that had some statistics on the rows deleted.
It's not in the protocol definition but it could be useful to include.
See DeltaLakeMetadata
The text was updated successfully, but these errors were encountered: