Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow forcing Delta Lake analyze to ignore previous analysis time #15968

Closed
alexjo2144 opened this issue Feb 3, 2023 · 0 comments · Fixed by #16634
Closed

Allow forcing Delta Lake analyze to ignore previous analysis time #15968

alexjo2144 opened this issue Feb 3, 2023 · 0 comments · Fixed by #16634
Assignees
Labels
enhancement New feature or request

Comments

@alexjo2144
Copy link
Member

alexjo2144 commented Feb 3, 2023

In some situations a Delta table may have stats collected which are not complete, for examble

  • CTAS a table, creates statistics (soon)
  • INSERT data with statistics disabled
  • INSERT data with statistics enabled

In this case to have complete stats a user would have to run the dropStats procedure and then re-analyze the data because the connector will ignore files older than the last analysis time. The procedure should have an argument to ignore it and analyze all files.

@findepi @findinpath @pajaks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Development

Successfully merging a pull request may close this issue.

2 participants