Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add comments for memory-related reduce operations. #1006

Merged
merged 2 commits into from
Aug 8, 2014

Conversation

reconditesea
Copy link
Contributor

Resolve #1000.
I found this documentation useful myself. Since when I was choosing sorting approaches, I would like to know which uses a in-memory heap and which uses external sorting.

@johnynek
Copy link
Collaborator

johnynek commented Aug 8, 2014

merge when green.

@@ -84,6 +84,11 @@ object Grouped {
}
}

/**
* All sorting methods defined in this trait are using
* Hadoop's reducer-side external sorting. i.e. it won't
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

aka 'secondary sorting' in Hadoop. Could say 'All sorting methods defined here trigger Hadoop secondary sort on value. i. e. won't materialize all values of each key in memory on the reducer'.

I should add a description to recent 'qna' expanding on answer that Oscar gave about using sortWithTake().

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, this makes the comments further clearer.

@rangadi
Copy link

rangadi commented Aug 8, 2014

Good patch btw. I felt the same : there are multiple places where we can add or expand scaladoc. You actually did it :)

@rangadi
Copy link

rangadi commented Aug 8, 2014

👍

jcoveney added a commit that referenced this pull request Aug 8, 2014
Add comments for memory-related reduce operations.
@jcoveney jcoveney merged commit a96fd73 into develop Aug 8, 2014
@jcoveney jcoveney deleted the klin_more_comments branch August 8, 2014 17:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

More comments for sorting algorithm in Grouped?
4 participants