【Hackathon 4 No.19】Add polygamma API to Paddle #53791

PommesPeter · 2023-05-14T05:48:49Z

PR types

New features

PR changes

APIs

Description

rfc doc here: PaddlePaddle/community#472
updated rfc doc here: PaddlePaddle/community#542
polygamma doc here: PaddlePaddle/docs#5913

paddle-bot · 2023-05-14T05:48:54Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot · 2023-05-14T05:48:57Z

❌ The PR is not created using PR's template. You can refer to this Demo.
Please use PR's template, it helps save our maintainers' time so that more developers get helped.

PommesPeter · 2023-05-22T12:43:24Z

https://github.com/PaddlePaddle/Paddle/blob/91a77fe079ad334e75a122168a8338fa3acd0a97/python/paddle/tensor/math.py#L5684-L5686
我想反馈一个 bug，出自于 paddle.digamma 的，因为本 api 中， n=0 的情况是可以直接调用 digamma api，但是 digamma 在 x=0 处的返回值和 scipy 的 special.digamma 并没有对齐，torch 也同理。

测试的结果如下：

scipy
torch
paddle

这是否可认为是一个 bug，根据定义 x=0 处的值应为 -inf，但 paddle 返回了 nan。 @luotao1 @zoooo0820 @Ligoml

zoooo0820 · 2023-05-23T07:32:56Z

这是否可认为是一个 bug，根据定义 x=0 处的值应为 -inf，但 paddle 返回了 nan。

@PommesPeter 你好，这里的确是有行为上的差异，torch从1.8版本开始将x=0点的输出从nan改为了-inf。想确认下这个行为差异会对polygamma的实现带来多少影响？除了x=0处输出的结果外，还会有其他的差异吗。

zoooo0820 · 2023-05-23T07:37:09Z

@PommesPeter 另外，从目前的实现看，新增了相关OP，这与RFC方案中基于现有API组合实现的方式有差异。出于飞桨多硬件适配等方面的考虑，是期望尽量减少基础算子的数量的。请问这里新增算子是否是因为遇到了RFC方案不可解决的问题，或是有其他特殊的考虑吗？

PommesPeter · 2023-05-23T07:56:02Z

这是否可认为是一个 bug，根据定义 x=0 处的值应为 -inf，但 paddle 返回了 nan。

@PommesPeter 你好，这里的确是有行为上的差异，torch从1.8版本开始将x=0点的输出从nan改为了-inf。想确认下这个行为差异会对polygamma的实现带来多少影响？除了x=0处输出的结果外，还会有其他的差异吗。

目前只发现 x=0 处的结果有问题，其他的不会有差异。

PommesPeter · 2023-05-23T08:06:20Z

@PommesPeter 另外，从目前的实现看，新增了相关OP，这与RFC方案中基于现有API组合实现的方式有差异。出于飞桨多硬件适配等方面的考虑，是期望尽量减少基础算子的数量的。请问这里新增算子是否是因为遇到了RFC方案不可解决的问题，或是有其他特殊的考虑吗？

是看到该算子初期的任务要求是需要使用 C++ 实现，参考 torch 实现后，发现使用 zeta 函数能够减少递归带来的运算量，rfc 文档里面预期是采用递归的方式，但该操作性能影响可能会较大，考虑算子计算性能故参考 torch 的实现。

zoooo0820 · 2023-05-23T09:45:52Z

是看到该算子初期的任务要求是需要使用 C++ 实现，参考 torch 实现后，发现使用 zeta 函数能够减少递归带来的运算量，rfc 文档里面预期是采用递归的方式，但该操作性能影响可能会较大，考虑算子计算性能故参考 torch 的实现。

辛苦也对应在RFC文档中，提交PR修改下OP正反向相关计算逻辑的介绍吧

PommesPeter · 2023-05-23T13:10:05Z

是看到该算子初期的任务要求是需要使用 C++ 实现，参考 torch 实现后，发现使用 zeta 函数能够减少递归带来的运算量，rfc 文档里面预期是采用递归的方式，但该操作性能影响可能会较大，考虑算子计算性能故参考 torch 的实现。

辛苦也对应在RFC文档中，提交PR修改下OP正反向相关计算逻辑的介绍吧

好的，已更新 PaddlePaddle/community#542

PommesPeter · 2023-05-24T07:55:45Z

rfc 文档已更新，麻烦 review 一下 @zoooo0820

zoooo0820 · 2023-05-24T09:52:46Z

目前只发现 x=0 处的结果有问题，其他的不会有差异。

@PommesPeter 你好，关于paddle.digamma在0点的行为问题，初步评估了一下，个人认为这里不需要专门改动digamma的行为，主要的理由是：

从数学的角度，0及负整数点，都属于第二类间断点，本身未定义函数值，且间断点两边极限不存在。所以个人认为，NaN是符合相关数学意义的，不能简单定义为Bug。
scipy/pytorch在1所述的场景上，本身行为也有相关的演化过程。目前区分了0和-0，分别返回-inf / inf，这表示对应的极限值。根据其演化过程中产生的讨论来看，这个改动本身也是存在一定的争议，因为看起来都是合理的。
从影响面看，仅一个特殊值点存在一些差异，且这个值实际不应是一个合法输入。

PommesPeter · 2023-05-28T11:29:42Z

目前只发现 x=0 处的结果有问题，其他的不会有差异。

@PommesPeter 你好，关于paddle.digamma在0点的行为问题，初步评估了一下，个人认为这里不需要专门改动digamma的行为，主要的理由是：

从数学的角度，0及负整数点，都属于第二类间断点，本身未定义函数值，且间断点两边极限不存在。所以个人认为，NaN是符合相关数学意义的，不能简单定义为Bug。

scipy/pytorch在1所述的场景上，本身行为也有相关的演化过程。目前区分了0和-0，分别返回-inf / inf，这表示对应的极限值。根据其演化过程中产生的讨论来看，这个改动本身也是存在一定的争议，因为看起来都是合理的。

从影响面看，仅一个特殊值点存在一些差异，且这个值实际不应是一个合法输入。

那目前的解决方案是继续保持该点的取值为 nan，剔除 zero case 的情况么？因为这个实现和目前 scipy 和 pytorch 不统一。目前 CI 仅存在该点的误差。

通过调研情况来看，x=0 是一个没有定义值的，那是否应该对 x=0 的情况做异常值处理，不允许 x=0 作为输入。

zoooo0820 · 2023-05-29T03:10:50Z

目前只发现 x=0 处的结果有问题，其他的不会有差异。

@PommesPeter 你好，关于paddle.digamma在0点的行为问题，初步评估了一下，个人认为这里不需要专门改动digamma的行为，主要的理由是：

从数学的角度，0及负整数点，都属于第二类间断点，本身未定义函数值，且间断点两边极限不存在。所以个人认为，NaN是符合相关数学意义的，不能简单定义为Bug。

scipy/pytorch在1所述的场景上，本身行为也有相关的演化过程。目前区分了0和-0，分别返回-inf / inf，这表示对应的极限值。根据其演化过程中产生的讨论来看，这个改动本身也是存在一定的争议，因为看起来都是合理的。

从影响面看，仅一个特殊值点存在一些差异，且这个值实际不应是一个合法输入。

那目前的解决方案是继续保持该点的取值为 nan，剔除 zero case 的情况么？因为这个实现和目前 scipy 和 pytorch 不统一。目前 CI 仅存在该点的误差。

通过调研情况来看，x=0 是一个没有定义值的，那是否应该对 x=0 的情况做异常值处理，不允许 x=0 作为输入。

当前保持现状即可，nan本身也有异常值的意义，n=0时对所有x直接复用digamma是可行的吧。

PommesPeter · 2023-05-29T03:12:36Z

目前只发现 x=0 处的结果有问题，其他的不会有差异。

@PommesPeter 你好，关于paddle.digamma在0点的行为问题，初步评估了一下，个人认为这里不需要专门改动digamma的行为，主要的理由是：

从数学的角度，0及负整数点，都属于第二类间断点，本身未定义函数值，且间断点两边极限不存在。所以个人认为，NaN是符合相关数学意义的，不能简单定义为Bug。

scipy/pytorch在1所述的场景上，本身行为也有相关的演化过程。目前区分了0和-0，分别返回-inf / inf，这表示对应的极限值。根据其演化过程中产生的讨论来看，这个改动本身也是存在一定的争议，因为看起来都是合理的。

从影响面看，仅一个特殊值点存在一些差异，且这个值实际不应是一个合法输入。

那目前的解决方案是继续保持该点的取值为 nan，剔除 zero case 的情况么？因为这个实现和目前 scipy 和 pytorch 不统一。目前 CI 仅存在该点的误差。
通过调研情况来看，x=0 是一个没有定义值的，那是否应该对 x=0 的情况做异常值处理，不允许 x=0 作为输入。

当前保持现状即可，nan本身也有异常值的意义，n=0时对所有x直接复用digamma是可行的吧。

对的，是可以直接复用 digamma，那我参考 digamma 的单测让 CI 通过。

zoooo0820 · 2023-05-29T03:20:23Z

那我参考 digamma 的单测让 CI 通过。

可以的

zoooo0820

CI-Approval 有一条关于检测到使用了std::cout / print的报告，辛苦再确认下是否是误报呢。

zoooo0820 · 2023-05-31T02:51:56Z

paddle/phi/kernels/cpu/polygamma_kernel.cc

+
+}  // namespace phi
+
+PD_REGISTER_KERNEL(


这里是否可以扩展更多dtype呢，如fp16等。kernel注册dtype时，尽量把目前理论上应当支持的，同时Paddle框架机制上也支持的dtype都包含进来，否则在某些特定场景会有问题。

在RFC设计阶段中，只支持fp32/fp64的原因主要是此前的方案是在digamma上进行，而digamma kernel支持的有限。这个后续也会通过其他专项任务去逐步扩展

zoooo0820 · 2023-05-31T02:53:16Z

python/paddle/tensor/math.py

+            return _C_ops.polygamma(x, n)
+        else:
+            check_variable_and_dtype(
+                x, "x", ["float32", "float64"], "polygamma"


kernel数据类型扩展后，这里可以相应放宽数据类型的支持

paddle/phi/kernels/impl/polygamma_kernel_impl.h

python/paddle/fluid/tests/unittests/test_polygamma_op.py

luotao1 · 2023-05-31T04:02:15Z

CI-Approval 有一条关于检测到使用了std::cout / print的报告，辛苦再确认下是否是误报呢。

是代码示例中用了print， @tianshuo78520a 在做规则增强，等该 PR 全部 ready 后，可以豁免

PommesPeter · 2023-05-31T06:53:41Z

CI-Approval 有一条关于检测到使用了std::cout / print的报告，辛苦再确认下是否是误报呢。

是代码示例中用了print， @tianshuo78520a 在做规则增强，等该 PR 全部 ready 后，可以豁免

好的

luotao1 · 2023-06-02T07:47:06Z

请修复下CI问题，可以merge develop

PommesPeter · 2023-06-02T07:51:42Z

请修复下CI问题，可以merge develop

好的，正在修复

luotao1 · 2023-06-03T05:27:19Z

单测覆盖率不够，请补充单测

PommesPeter · 2023-06-03T08:38:58Z

单测覆盖率不够，请补充单测

好的

zoooo0820

LGTM

python/paddle/tensor/math.py

Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com>

sunzhongkai588

LGTM for docs

sunzhongkai588

LGTM for docs

jeff41404

LGTM

paddle-bot bot added contributor External developers status: proposed labels May 14, 2023

luotao1 assigned luotao1, zoooo0820 and Ligoml May 16, 2023

luotao1 added the API label May 16, 2023

paddle-bot bot removed the status: proposed label May 16, 2023

luotao1 added the PaddlePaddle Hackathon label May 17, 2023

PommesPeter mentioned this pull request May 22, 2023

【PaddlePaddle Hackathon 第四期】任务总览 #51281

Closed

zoooo0820 reviewed May 31, 2023

View reviewed changes

zoooo0820 previously approved these changes Jun 2, 2023

View reviewed changes

PommesPeter dismissed zoooo0820’s stale review via 41bc060 June 2, 2023 08:18

PommesPeter added 9 commits June 2, 2023 18:25

feat: added polygamma init code

f210c7a

feat: added polygamma unittest code

871c334

test: added more test cases

b27766b

refactor: added forward impl

9b638b9

refactor: added backward impl

f936cd3

test: updated cases

6777dbc

refactor: updated test cases

c75e318

refactor: added more case and fixed some bugs

9f1fb85

test: updated ref func

f9251c6

PommesPeter force-pushed the polygamma branch from 41bc060 to f9251c6 Compare June 2, 2023 10:28

refactor: updated code style

b770def

PommesPeter added 3 commits June 3, 2023 17:40

refactor: move the code

a443ded

refactor: updated test

df232a6

refactor: updated test

4627e71

zoooo0820 previously approved these changes Jun 5, 2023

View reviewed changes

luotao1 assigned sunzhongkai588 Jun 5, 2023

sunzhongkai588 reviewed Jun 5, 2023

View reviewed changes

python/paddle/tensor/math.py Outdated Show resolved Hide resolved

PommesPeter dismissed zoooo0820’s stale review via 2f45aa3 June 5, 2023 04:20

PommesPeter and others added 2 commits June 5, 2023 12:20

docs: updated en doc

2f45aa3

Co-authored-by: zachary sun <70642955+sunzhongkai588@users.noreply.github.com>

docs: updated math eq

5375245

sunzhongkai588 approved these changes Jun 5, 2023

View reviewed changes

jeff41404 approved these changes Jun 5, 2023

View reviewed changes

This comment was marked as duplicate.

Sign in to view

luotao1 merged commit ed60456 into PaddlePaddle:develop Jun 5, 2023
24 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【Hackathon 4 No.19】Add polygamma API to Paddle #53791

【Hackathon 4 No.19】Add polygamma API to Paddle #53791

PommesPeter commented May 14, 2023 •

edited

paddle-bot bot commented May 14, 2023

paddle-bot bot commented May 14, 2023

PommesPeter commented May 22, 2023 •

edited

zoooo0820 commented May 23, 2023

zoooo0820 commented May 23, 2023

PommesPeter commented May 23, 2023

PommesPeter commented May 23, 2023

zoooo0820 commented May 23, 2023

PommesPeter commented May 23, 2023 •

edited

PommesPeter commented May 24, 2023

zoooo0820 commented May 24, 2023

PommesPeter commented May 28, 2023

zoooo0820 commented May 29, 2023

PommesPeter commented May 29, 2023

zoooo0820 commented May 29, 2023

zoooo0820 left a comment

zoooo0820 May 31, 2023

zoooo0820 May 31, 2023

luotao1 commented May 31, 2023

PommesPeter commented May 31, 2023

luotao1 commented Jun 2, 2023 •

edited

PommesPeter commented Jun 2, 2023

luotao1 commented Jun 3, 2023

PommesPeter commented Jun 3, 2023

zoooo0820 left a comment

sunzhongkai588 left a comment

sunzhongkai588 left a comment

jeff41404 left a comment

This comment was marked as duplicate.

【Hackathon 4 No.19】Add polygamma API to Paddle #53791

【Hackathon 4 No.19】Add polygamma API to Paddle #53791

Conversation

PommesPeter commented May 14, 2023 • edited

PR types

PR changes

Description

paddle-bot bot commented May 14, 2023

paddle-bot bot commented May 14, 2023

PommesPeter commented May 22, 2023 • edited

zoooo0820 commented May 23, 2023

zoooo0820 commented May 23, 2023

PommesPeter commented May 23, 2023

PommesPeter commented May 23, 2023

zoooo0820 commented May 23, 2023

PommesPeter commented May 23, 2023 • edited

PommesPeter commented May 24, 2023

zoooo0820 commented May 24, 2023

PommesPeter commented May 28, 2023

zoooo0820 commented May 29, 2023

PommesPeter commented May 29, 2023

zoooo0820 commented May 29, 2023

zoooo0820 left a comment

Choose a reason for hiding this comment

zoooo0820 May 31, 2023

Choose a reason for hiding this comment

zoooo0820 May 31, 2023

Choose a reason for hiding this comment

luotao1 commented May 31, 2023

PommesPeter commented May 31, 2023

luotao1 commented Jun 2, 2023 • edited

PommesPeter commented Jun 2, 2023

luotao1 commented Jun 3, 2023

PommesPeter commented Jun 3, 2023

zoooo0820 left a comment

Choose a reason for hiding this comment

sunzhongkai588 left a comment

Choose a reason for hiding this comment

sunzhongkai588 left a comment

Choose a reason for hiding this comment

jeff41404 left a comment

Choose a reason for hiding this comment

This comment was marked as duplicate.

PommesPeter commented May 14, 2023 •

edited

PommesPeter commented May 22, 2023 •

edited

PommesPeter commented May 23, 2023 •

edited

luotao1 commented Jun 2, 2023 •

edited