Design: infer_var_type #4795

reyoung · 2017-10-13T19:03:10Z

No description provided.

QiJune · 2017-10-13T19:13:52Z

Maybe we should also register InferShape to OpInfo @jacquesqiao

Superjomn · 2017-10-13T19:14:34Z

doc/design/infer_var_type.md

+
+The variable in our design can hold variant types. Such as `LoDTensor` and `SelectedRows`. An operator should be able to inference the variable types of its output.
+
+For example, a `lookup table` operator takes two `LoDTensor`; one is a float tensor as the embedding table, the other is an int tensor as word ID. The gradient operator of `lookup table` will generate a `SelectedRows` as its output. A `sum` operator can take both `LoDTensor` and `SelectedRows` as its inputs and will generate a `LoDTensor` if any of its inputs is `LoDTensor`, otherwise, the `sum` operator will generate `SelectedRows` as its output.


In my understanding, SelectedRows is a LoDTensor which stores word embeddings and a LoD which stores the sequence info and a seleted row ids (may be a vector<int>).

the LoDTensor is supported by other ops already.

In my option, the selected row ids is only used by lookup and do not need to be exposed to other ops such as sum, and no necessary to have a new data type that should be processed by ops besides LoDTensor;

Had a discussion with reyang, the new type SelectedRow serves in scenarios such as sum the outputs of two lookup.

It seems ok and there is no better way to support this currently.

wangkuiyi

InferVarType and InferShape are actually the same thing -- it should be called "Infer Type"

reyoung · 2017-10-16T17:18:27Z

InferVarType and InferShape are actually the same thing -- it should be called "Infer Type"

@wangkuiyi
However, I think there are several differences between InferVarType and InferShape.

InferVarType MUST be invoked before InferShape, since we must set type before we set shape due to our protobuf design.
InferVarType will be invoked ONLY when compiling since it will not be changed at runtime. However, InferShape will be invoked both compile-time and run-time.
Most operators do not need to write InferVarType since they just generate LoDTensor as its output.

QiJune

LGTM

reyoung requested review from jacquesqiao, JiayiFeng and QiJune October 13, 2017 19:03

Design Doc: infer_var_type

f654810

Superjomn requested a review from qingqing01 October 13, 2017 19:05

Superjomn reviewed Oct 13, 2017

View reviewed changes

reyoung changed the title ~~Design Doc: infer_var_type~~ Design & implementation: infer_var_type Oct 13, 2017

reyoung changed the title ~~Design & implementation: infer_var_type~~ Design: infer_var_type Oct 13, 2017

reyoung mentioned this pull request Oct 13, 2017

Complete infer_var_type #4797

Merged

reyoung force-pushed the feature/var_type_inferer branch from 7ed3ea6 to f654810 Compare October 13, 2017 21:07

wangkuiyi mentioned this pull request Oct 16, 2017

Implement infer_var_type #4820

Closed

wangkuiyi reviewed Oct 16, 2017

View reviewed changes

QiJune approved these changes Oct 16, 2017

View reviewed changes

reyoung merged commit f43b1a9 into PaddlePaddle:develop Oct 17, 2017

QiJune added this to Done in PaddlePaddle Refactoring: Phase 3 Oct 18, 2017

reyoung deleted the feature/var_type_inferer branch October 28, 2017 22:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design: infer_var_type #4795

Design: infer_var_type #4795

reyoung commented Oct 13, 2017

QiJune commented Oct 13, 2017

Superjomn Oct 13, 2017

Superjomn Oct 13, 2017

wangkuiyi left a comment

reyoung commented Oct 16, 2017 •

edited

Loading

QiJune left a comment


		The variable in our design can hold variant types. Such as `LoDTensor` and `SelectedRows`. An operator should be able to inference the variable types of its output.

		For example, a `lookup table` operator takes two `LoDTensor`; one is a float tensor as the embedding table, the other is an int tensor as word ID. The gradient operator of `lookup table` will generate a `SelectedRows` as its output. A `sum` operator can take both `LoDTensor` and `SelectedRows` as its inputs and will generate a `LoDTensor` if any of its inputs is `LoDTensor`, otherwise, the `sum` operator will generate `SelectedRows` as its output.

Design: infer_var_type #4795

Design: infer_var_type #4795

Conversation

reyoung commented Oct 13, 2017

QiJune commented Oct 13, 2017

Superjomn Oct 13, 2017

Choose a reason for hiding this comment

Superjomn Oct 13, 2017

Choose a reason for hiding this comment

wangkuiyi left a comment

Choose a reason for hiding this comment

reyoung commented Oct 16, 2017 • edited Loading

QiJune left a comment

Choose a reason for hiding this comment

reyoung commented Oct 16, 2017 •

edited

Loading