Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make compaction work with column slicing, dynamic schema and sparse data #1505

Open
willdealtry opened this issue Apr 18, 2024 · 0 comments
Open
Labels
enhancement New feature or request

Comments

@willdealtry
Copy link
Collaborator

At the moment compaction is not safe to perform on quite a lot of data - essentially it only works for static schema. Ideally it should work more like downsampling in the processing pipeline, in that it needs to multiply reference segments where the new segment boundary crosses the existing boundary.

Long term it would be desirable to be able to physically compact existing versions in a way that left them logically the same, however we would need to have great confidence in its robustness before taking the step of altering existing data, so it needs a really good test suite involving some declarative testing to flush out the edge cases.

@willdealtry willdealtry added the enhancement New feature or request label Apr 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant