fix dtype not matching bug in log_prob and probs method of Distribution class #26767

pangyoki · 2020-08-28T04:10:30Z

PR types

Bug fixes

PR changes

APIs

Describe

In _to_tensor method of Distribution class (refer to PR #26355 and PR #26535). Even if we want to support both float32 and float64 dtype in Distribution classes, when parameters (low and high in Uniform, loc and scale in Normal) are numpy.ndarray and dtypes are float64, we can only set dtype to be float32 using assign op to get the correspoding variable. Becase assign op doesn't support float64 when input is numpy.ndarray.

Thus, when parameters are `numpy.ndarray` with `float64` dtype, 
we will transfer them into `VarType.FP32` variable.

In log_prob and probs methods in Distribution class, the input value of these methods is a tensor. In users' view, it's reasonable that the dtype of value and parameters are same.

When dtype of parameters are `float64` in `numpy.ndarray` and dtype of `value` is `VarType.FP64`, 
it will cause error because the dtype of parameters change to `VarType.FP32`.

The following is an example code:

import numpy as np
import paddle
from paddle.distribution import Normal

paddle.disable_static()

value_np = np.array([0.8, 0.3], dtype='float64')
value_tensor = paddle.to_tensor(value_np)  # 'float64' Tensor

loc_np = np.array([1, 2]).astype('float64')  # will be converted to 'float32' Tensor automatically
scale_np = np.array([11, 22]).astype('float64')    # will be converted to 'float32' Tensor automatically
normal = Normal(loc_np, scale_np)

lp = normal.log_prob(value_tensor)  # error !!!

We are going to let assign op support float64, but it will lose precision because Attr don't support float64 in framework.proto (refer to #26797). That is, assign op can only support float32.

Thus, in this PR, we use cast operation to convert dtype after assign op if dtype is float64.
If users define a Uniform distribution whose low and high are float64 numpy.ndarray, we firstly use assign op to get float32 variable. Then use cast to get float64 variable.
What's more, probs and log_prob methods have a variable input named values. If dtype of values is different with low in Uniform or loc in Normal, it will cause error.
To solve this dtype conflict, we cast dtype of values to be the same as that of low or loc. (in _check_values_dtype_in_probs function)
In Doc discribtion, we add formula for entropy and kl-divergence methods. Formula for log_prob and probs have been given in doc of class, that is, the pdf (probability density function) of the distribution.
By the way, we rewrite unittest to make it more readable.

paddle-bot-old · 2020-08-28T04:11:26Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhiqiu · 2020-09-03T04:24:39Z

python/paddle/distribution.py

+                raise TypeError(
+                    "Type of input args must be float, list, numpy.ndarray or Tensor, but received type {}".
+                    format(type(arg)))
+
            arg_np = np.array(arg)
            arg_dtype = arg_np.dtype
            if str(arg_dtype) not in ['float32']:


Suggested change

if str(arg_dtype) not in ['float32']:

if str(arg_dtype) != 'float32':

And, the positive conditions is better than the negative conditions.

zhiqiu · 2020-09-03T04:25:11Z

python/paddle/distribution.py

-                    "data type of argument only support float32, your argument will be convert to float32."
-                )
+                # "assign" op doesn't support float64. if dtype is float64, float32 variable will be generated and transformed to float64 later using "cast".
+                if str(arg_dtype) not in ['float64']:


Suggested change

if str(arg_dtype) not in ['float64']:

if str(arg_dtype) != 'float64':

zhiqiu · 2020-09-03T04:31:00Z

python/paddle/distribution.py

            if isinstance(arg, float):
-                arg = np.zeros(1) + arg
+                arg = [arg]
+            elif not isinstance(arg, list) and not isinstance(arg, np.ndarray):


Suggested change

elif not isinstance(arg, list) and not isinstance(arg, np.ndarray):

elif not isinstance(arg, (list, np.ndarray)):

zhiqiu

LGTM

jzhang533

lgtm

…on class (PaddlePaddle#26767) * fix _to_tensor method of Distribution class * Add unittest * let dtype be consistent with value in log_prob and probs * fix format * fix dtype problem and change unittest * fix dtype of Numpy class in unittest * add formula for entropy and kl * change formula * fix kl formula format * fix kl formula format 2 * change gt to np in unittest * optimize unittest format * delete dumplicate * delete dumplicate 2 * extract common function used to convert dtype value

* fix dtype not matching bug in log_prob and probs method of Distribution class (#26767) * fix _to_tensor method of Distribution class * Add unittest * let dtype be consistent with value in log_prob and probs * fix format * fix dtype problem and change unittest * fix dtype of Numpy class in unittest * add formula for entropy and kl * change formula * fix kl formula format * fix kl formula format 2 * change gt to np in unittest * optimize unittest format * delete dumplicate * delete dumplicate 2 * extract common function used to convert dtype value * cherry pick 27046

pangyoki force-pushed the fix-_to_tensor-dtype-error-branch branch 6 times, most recently from 18029cd to 4da51a3 Compare September 2, 2020 19:32

zhiqiu reviewed Sep 3, 2020

View reviewed changes

pangyoki added 15 commits September 3, 2020 16:29

fix _to_tensor method of Distribution class

7fa5ecf

Add unittest

3bd34a0

let dtype be consistent with value in log_prob and probs

ef5fd84

fix format

a1a30f4

fix dtype problem and change unittest

472d057

fix dtype of Numpy class in unittest

2549fd8

add formula for entropy and kl

85b4118

change formula

f1ed2b8

fix kl formula format

9cbc479

fix kl formula format 2

8669d05

change gt to np in unittest

78a965d

optimize unittest format

8e63935

delete dumplicate

e88164a

delete dumplicate 2

5e7de82

extract common function used to convert dtype value

d3312c6

pangyoki force-pushed the fix-_to_tensor-dtype-error-branch branch from 30827ca to d3312c6 Compare September 3, 2020 16:29

pangyoki changed the title ~~fix _to_tensor method of Distribution class~~ fix dtype not matching bug in log_prob and probs method of Distribution class Sep 4, 2020

zhiqiu approved these changes Sep 4, 2020

View reviewed changes

jzhang533 approved these changes Sep 4, 2020

View reviewed changes

zhiqiu merged commit a0c98e6 into PaddlePaddle:develop Sep 4, 2020

pangyoki mentioned this pull request Sep 4, 2020

fix _check_values_dtype_in_probs method in Distribution class #27046

Merged

pangyoki mentioned this pull request Sep 7, 2020

Cherry pick 26767 #27102

Merged

pangyoki mentioned this pull request Sep 24, 2020

fix the precision problem of test_distribution #27524

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix dtype not matching bug in log_prob and probs method of Distribution class #26767

fix dtype not matching bug in log_prob and probs method of Distribution class #26767

pangyoki commented Aug 28, 2020 •

edited

paddle-bot-old bot commented Aug 28, 2020

zhiqiu Sep 3, 2020

zhiqiu Sep 3, 2020

pangyoki Sep 4, 2020

zhiqiu Sep 3, 2020

pangyoki Sep 4, 2020

zhiqiu Sep 3, 2020

pangyoki Sep 4, 2020

zhiqiu left a comment

jzhang533 left a comment

	if str(arg_dtype) not in ['float32']:
	if str(arg_dtype) != 'float32':

	if str(arg_dtype) not in ['float64']:
	if str(arg_dtype) != 'float64':

	elif not isinstance(arg, list) and not isinstance(arg, np.ndarray):
	elif not isinstance(arg, (list, np.ndarray)):

fix dtype not matching bug in log_prob and probs method of Distribution class #26767

fix dtype not matching bug in log_prob and probs method of Distribution class #26767

Conversation

pangyoki commented Aug 28, 2020 • edited

PR types

PR changes

Describe

paddle-bot-old bot commented Aug 28, 2020

zhiqiu Sep 3, 2020

Choose a reason for hiding this comment

zhiqiu Sep 3, 2020

Choose a reason for hiding this comment

pangyoki Sep 4, 2020

Choose a reason for hiding this comment

zhiqiu Sep 3, 2020

Choose a reason for hiding this comment

pangyoki Sep 4, 2020

Choose a reason for hiding this comment

zhiqiu Sep 3, 2020

Choose a reason for hiding this comment

pangyoki Sep 4, 2020

Choose a reason for hiding this comment

zhiqiu left a comment

Choose a reason for hiding this comment

jzhang533 left a comment

Choose a reason for hiding this comment

pangyoki commented Aug 28, 2020 •

edited