-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hadoop reindexing task doesn't remove dimensions #5095
Comments
on a quick look , task spec ( https://pastebin.com/raw/cxFBYn7q ) looks fine to me , you don't need to include same list in ingestionSpec.dimensions . however it should have still worked as the list is same. how did you verify that new segments still had unwanted dimensions that you did not mention in your task spec? |
@himanshug The indexing job succeeded, I verified by :
In the indexing-logs I can see :
whitelisted dimensions only :
|
In the |
ah, I revisited those segments again and it looks that only last 3 segments from 26 did not get updated Ok thank you @himanshug now it's gonna be easy to investigate the rest. Look at the timestamp of those last 3 segments :
|
Hey,
on druid
0.10.1
I'm having troubles cleaning up segments from unwanted dimensions using hadoop indexing task. Discussed in here https://groups.google.com/forum/#!topic/druid-user/aBbuMYNRID8I tried both schemaless approach
dimensionExclusions
https://pastebin.com/raw/DwXw8dmE anddimensions
https://pastebin.com/raw/cxFBYn7q ... segments get reindexed but they contain all those unwanted dimensions.At first I thought that the dimensions are not picked up from ParseSpec, so I also tried to specify it in
IoConfig.InputSpec.IngestionSpec.dimension
too, but that didn't help.The text was updated successfully, but these errors were encountered: