add compile vs runtime discussion #3728

jacquesqiao · 2017-08-28T23:57:14Z

pull request is better for review than issue
讨论的过程也可作做一个记录

Superjomn · 2017-08-29T00:11:03Z

doc/design/complile-vs-runtime.md

+1. InferShape只需要实现一遍，配置时和运行时都调用同样的函数即可
+
+#### 缺点
+1. Clone的实现可能并不简单，比如多种设备类型之间内存如何同步(Scope for CPU vs Scope for GPU)


可以加上 InferShape 在RNNOp里，每个时间步变长的情况的问题，另外 cond_op 的output也是只有在 Run 后才知道具体 Shape 的

dzhwinter · 2017-08-29T01:39:28Z

doc/design/complile-vs-runtime.md

+1. 切换Scope简单，只需在Op Run的时候传入一个新的Scope，框架根据全局的VarDesc Map在其中创建对应的Var即可运行。
+2. 用VarDesc存储元信息，方便做图的优化。
+3. InferShape就可以不需要传入Scope这个参数，因为修改的VarDesc都存在于全局的map中
+


添加一个下午讨论到的点
VarDesc可以提供模型(描述l)序列化。有以下好处
1、保证paddle cloud的数据安全性，用户提交脚本只能运行包含对应序列化模型，可以在服务端做检查。
2、在对源代码安全性要求高的业务线，部署模型和训练的源代码隔离

dzhwinter · 2017-08-29T01:43:47Z

doc/design/complile-vs-runtime.md

+1. InferShape实现复杂，编译时InferShape是基于VarDesc，但是运行时也同样需要做InferShape和resize()，因为  
+	a. 运行时size可能会被用户修改.  
+	b. Op实现也要求运行时需要做InferShape(例如RNN)  
+


于洋提到的一个check变成三个，复杂度上升了

另外set_size时候check允许用户改除batch_size之外的size吗？

QiJune · 2017-08-29T02:47:06Z

Please consider this by the way: #3717 (comment)

QiJune · 2017-08-29T03:06:28Z

请考虑https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/framework/backward.cc#L145 这里的 add operator的设计。这里的add实际上是对不定个输入Tensor做求和，得到一个Tensor。

具体输入数据有几个，是在运行时根据用户的数据决定的；而编译时，我们应该怎么设计相应的AddOpProto 以及AddOperator来可以接收运行时的参数。

Superjomn · 2017-08-29T17:24:17Z

doc/design/complile-vs-runtime.md

+   - 编译完之后，用户手动修改了某些参数的维度，如果不做check，将无法发现错误。
+   - 有些维度，必须在看到真实数据之后才能确定，例如condition op(if else while)等，他们的分支选择之后的数据size，必须知道真实数据之后才能确定。
+
+**结论是**：InferShape需要在编译时和运行时都被调用，编译时主要用于check size的配置是否正确，运行时一方面要check size是否正确，还要根据真实数据来做一些resize.


infershape 和 check size 是两个问题吧，infer 和 check 阐述的时候可以分开

现在感觉，有的时候说的是 infer 有的时候说的是 check

ok, 那就把check size 和infersize/resize作为两件事情，分别来说明

Superjomn · 2017-08-29T17:27:41Z

doc/design/complile-vs-runtime.md

+4. 在分布式场景下，需要将图序列化之后发送给别的机器执行，这个终归是需要将Variable的相关属性也序列化的。这个点带来一个好处是，云端执行任务是可控的，用户发过来的是一个序列化的图，而不是一个脚本的源代码，有利于数据安全。
+
+#### 缺点
+1. InferShape实现复杂，编译时InferShape是基于VarDesc，但是运行时也同样需要做InferShape和resize()，因为


双层变长RNN，外层 RNN 的 output 的 shape比较难在 Run 完前确定，所以，一种自然的想法是， Run 之后再设置 output 的 shape.

jacquesqiao added 2 commits August 28, 2017 16:55

add compile vs runtime discussion

2562144

add compile vs runtime discussion

410c2f5

jacquesqiao requested review from reyoung, Superjomn, wangkuiyi, dzhwinter and JiayiFeng August 29, 2017 00:07

Superjomn reviewed Aug 29, 2017

View reviewed changes

dzhwinter reviewed Aug 29, 2017

View reviewed changes

jacquesqiao added 2 commits August 29, 2017 09:27

add reason about distribute

d385633

add consider about InferShape

e639850

Superjomn reviewed Aug 29, 2017

View reviewed changes

jacquesqiao mentioned this pull request Sep 3, 2017

Unify Python and C++ InferShape for VarDesc and Tensor. #3776

Closed

jacquesqiao closed this Jan 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add compile vs runtime discussion #3728

add compile vs runtime discussion #3728

jacquesqiao commented Aug 28, 2017 •

edited

Loading

Superjomn Aug 29, 2017

jacquesqiao Aug 29, 2017

dzhwinter Aug 29, 2017

dzhwinter Aug 29, 2017

QiJune commented Aug 29, 2017

QiJune commented Aug 29, 2017

Superjomn Aug 29, 2017

jacquesqiao Aug 29, 2017

Superjomn Aug 29, 2017

add compile vs runtime discussion #3728

add compile vs runtime discussion #3728

Conversation

jacquesqiao commented Aug 28, 2017 • edited Loading

Superjomn Aug 29, 2017

Choose a reason for hiding this comment

jacquesqiao Aug 29, 2017

Choose a reason for hiding this comment

dzhwinter Aug 29, 2017

Choose a reason for hiding this comment

dzhwinter Aug 29, 2017

Choose a reason for hiding this comment

QiJune commented Aug 29, 2017

QiJune commented Aug 29, 2017

Superjomn Aug 29, 2017

Choose a reason for hiding this comment

jacquesqiao Aug 29, 2017

Choose a reason for hiding this comment

Superjomn Aug 29, 2017

Choose a reason for hiding this comment

jacquesqiao commented Aug 28, 2017 •

edited

Loading