Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CARBONDATA-1867] Add support for task/segment level pruning #1624

Closed

Conversation

manishgupta88
Copy link
Contributor

@manishgupta88 manishgupta88 commented Dec 6, 2017

Added support for task/segment level pruning. Added code to compute task level min/max which can be helpful for task/segment level pruning

  • Any interfaces changed?

  • Any backward compatibility impacted?

  • Document update required?

  • Testing done

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@CarbonDataQA
Copy link

Build Success with Spark 2.2.0, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/499/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/1762/

* @return
* @throws IOException
*/
public boolean isScanRequired(String segmentId, FilterResolverIntf filterExp) throws IOException {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It should not be isScanRequired, it should be pruneSegments that should take segmentslist and filterexp, and returns the pruned list of segments

* @param taskMinMaxRow
* @return
*/
public void addTaskMinMaxRowToUnsafe(DataMapRow taskMinMaxRow, List<Integer> indexesToAccess)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why it is required a new method? why don't use addToUnsafe

} else {
byte[][] existingMinMaxValues = getMinMaxValue(taskMinMaxRow, ordinal);
// Compare and update min max values
SerializableComparator comparator =
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think you can use UnsafeComparer.compare

}
}

private void addRowToUnsafeMemoryStore(DataMapRow row) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think better set some junk or default values exept for the minmax in case of last row. so that it will be easy to read and write

@ravipesala
Copy link
Contributor

SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/2145/

…ask level min/max which can be helpful for task/segment level pruning
@manishgupta88 manishgupta88 changed the title [WIP][CARBONDATA-1867] Add support for task/segment level pruning [CARBONDATA-1867] Add support for task/segment level pruning Dec 6, 2017
@CarbonDataQA
Copy link

Build Success with Spark 2.2.0, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/523/

@manishgupta88
Copy link
Contributor Author

@ravipesala ..handled review comments..kindly review and merge

@CarbonDataQA
Copy link

Build Success with Spark 2.2.0, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/524/

@CarbonDataQA
Copy link

Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/1780/

@ravipesala
Copy link
Contributor

SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/2155/

@ravipesala
Copy link
Contributor

SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/2156/

@CarbonDataQA
Copy link

Build Success with Spark 2.2.0, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/549/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/1807/

@ravipesala
Copy link
Contributor

SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/2179/

@ravipesala
Copy link
Contributor

LGTM

@asfgit asfgit closed this in 2bad144 Dec 7, 2017
@manishgupta88 manishgupta88 deleted the add_task_level_min_max branch December 12, 2017 05:45
jatin9896 pushed a commit to jatin9896/incubator-carbondata that referenced this pull request Jan 5, 2018
Added support for task/segment level pruning. Added code to compute task level min/max which can be helpful for task/segment level pruning

This closes apache#1624
anubhav100 pushed a commit to anubhav100/incubator-carbondata that referenced this pull request Jun 22, 2018
Added support for task/segment level pruning. Added code to compute task level min/max which can be helpful for task/segment level pruning

This closes apache#1624
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants