-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
希望增加group_by之后的统计函数? #49
Comments
I can not understand what you mean。 |
i mean, in addition to sum(), count(), could bigflow support mean()/variance() and other popular statistical function for PCollection ? |
Actually, you can use:
to implement then, you can use them in
At the same time, if you want to use it to a global pcollection, you can just use
or just call it directly:
Because it's easy to implement these functions, so we don't regard them as built-in methods. If you find it difficult to write these functions, you can always use
And you can implement a method to get both sum and mean, and use them in
|
I think there should be a module to provide available or useful functions. |
在对key group_by之后,希望可以方便做求均值,求方差,排序再遍历这样的操作;
希望可以提供类似这样的内置函数
The text was updated successfully, but these errors were encountered: