-
-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simplify partition structure #728
Conversation
3a481df
to
96de62f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is looking fine so far. I'm excited for this change, but as you said in-person the export/count side needs to be implemented first to see if this all works out or if you forgot about some component that needs to be touched as well.
I can't see any besides the things we talked about yesterday. So far things are looking fine, but there's still lots of work to do.
8906a41
to
6c6d6cf
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall, the architecture of the change set feels right. Unfortunately there are too many difference concerns and optimizations in one big chunk that it's hard to review accurately. Splitting this up in more digestible pieces would be the next step.
a698e19
to
fb587c3
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is getting close. At this point I have mainly refactoring comments.
00e29b6
to
5e549b1
Compare
The recent round of comments have been addressed or replied to. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Almost there!
Prior to this commit, a partition was a container for a set of tables that each directly corresponded to one layout as imported into VAST. This commit modifies the index to decompose table slices into columns at the boundary and promotes the indexer actors responsible for each such column into a member of the partition. The effect is twofold: - An updated schema will not result in a duplication of all columns that exist in both versions of a type with the same name, but only the ones that are actually different. - The filesystem structure is consolidated in the same way, which results in slightly less reads when a query hits a persisted partition, and it is easier to inspect the the disk space usage of the individual fields of the input.
Co-Authored-By: Matthias Vallentin <matthias@tenzir.com>
Co-Authored-By: Matthias Vallentin <matthias@tenzir.com>
This needs to be cleaned and rebased. Do not review yet.
TODO: