bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists)，作用是啥。 #37

tianke0711 · 2021-06-06T08:43:41Z

bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists)，作用是啥。在训练中有啥好处。如果不排序是否也没事呢。


def sort_by_lengths(word_lists, tag_lists):
    pairs = list(zip(word_lists, tag_lists))
    indices = sorted(range(len(pairs)),
                     key=lambda k: len(pairs[k][0]),
                     reverse=True)
    pairs = [pairs[i] for i in indices]
    # pairs.sort(key=lambda pair: len(pair[0]), reverse=True)

    word_lists, tag_lists = list(zip(*pairs))

    return word_lists, tag_lists, indices

The text was updated successfully, but these errors were encountered:

qukequke · 2021-07-21T03:28:15Z

前面没对batch中的句子做补零，在取到batch之后，用batch中的第一个句子的长度作为最大长度对batch内的剩余句子进行补零，所以排序是有用的

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists)，作用是啥。 #37

bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists)，作用是啥。 #37

tianke0711 commented Jun 6, 2021 •

edited

Loading

qukequke commented Jul 21, 2021

bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists)，作用是啥。 #37

bilistm_crf模型中的为啥要sort_by_lengths(word_lists, tag_lists)，作用是啥。 #37

Comments

tianke0711 commented Jun 6, 2021 • edited Loading

qukequke commented Jul 21, 2021

tianke0711 commented Jun 6, 2021 •

edited

Loading