Very slow (exponential) parsing speed when having a large depth of addto_layer #2797

xyz2357 · 2017-07-10T22:26:35Z

When there's a lot of addto_layers in a network, it will takes exponential parsing time.
E.G. 1:

def whole_network(src_embedding):
    enc = src_embedding
    for i in range(depth):
        enc = addto_layer([enc, enc])

    pred = fc_layer(input=fc_layer(
                        input=enc,
                        size=dim_embedding,
                        act=ReluActivation()
                        ),
                    size=label_dict_len,
                    act=SoftmaxActivation())
    return pred

E.G. 2:

def whole_network(src_embedding):
    enc = src_embedding
    for i in range(depth):
        enc_res = fc_layer(input=enc, size=dim_embedding)
        enc_res = fc_layer(input=enc_res, size=dim_embedding)
        enc = addto_layer([enc, enc_res])

    pred = fc_layer(input=fc_layer(
                        input=enc,
                        size=dim_embedding,
                        act=ReluActivation()
                        ),
                    size=label_dict_len,
                    act=SoftmaxActivation())
    return pred

Both will costs a huge amount of time to parse (test by the nest_diagram tool).
My parsing time:

depth: 4,   parsing time: 0.16.
depth: 8,   parsing time: 0.16.
depth: 12, parsing time: 0.33.
depth: 16, parsing time: 2.02.
depth: 20, parsing time: 32.08.
depth: 21, parsing time: 67.05.
depth: 22, parsing time: 131.48.
depth: 23, parsing time: 268.82.

The text was updated successfully, but these errors were encountered:

wangkuiyi · 2017-07-11T00:11:49Z

Just mark that " parsing time means the time used for parsing and generating the protobuf".

* Fix PaddlePaddle#2797 * It because trainer_config_helpers' __dfs_travel__ did not record the node which travelled, and if the topology has a recursive dependency, there are some nodes will be travelled multiple times. * Add a `travelled` set to record which node is travelled. * Also add a unittest for this situation.

reyoung · 2017-07-11T07:19:38Z

This issue is fixed by #2802. It because when we parsing network topology by outputs(xxx), it just runs a depth-first search without any optimization.

In #2802, if a node is visited before, that node is just skipped. It will fix this issue.

You can just change network.py in paddle python package by this patch.

xyz2357 · 2017-07-11T18:29:20Z

Thanks a lot!

wangkuiyi assigned reyoung Jul 11, 2017

reyoung added the Bug label Jul 11, 2017

reyoung mentioned this issue Jul 11, 2017

Fix slow parsing a recursive depends topology #2802

Merged

reyoung closed this as completed in #2802 Jul 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very slow (exponential) parsing speed when having a large depth of addto_layer #2797

Very slow (exponential) parsing speed when having a large depth of addto_layer #2797

xyz2357 commented Jul 10, 2017

wangkuiyi commented Jul 11, 2017

reyoung commented Jul 11, 2017

xyz2357 commented Jul 11, 2017

Very slow (exponential) parsing speed when having a large depth of addto_layer #2797

Very slow (exponential) parsing speed when having a large depth of addto_layer #2797

Comments

xyz2357 commented Jul 10, 2017

wangkuiyi commented Jul 11, 2017

reyoung commented Jul 11, 2017

xyz2357 commented Jul 11, 2017