sequence length independent generation #45

tmphex · 2021-05-25T20:41:54Z

Currently generation require passing sequence length to generate sequences of given length but say in tasks such as summary or translation, one doesn't know about the final sequence length. Currently I am trying to generate candidates with passing various lengths as work around. Also is it possible to add support for beam search method for generation in addition to current top_p/top_k methods.

lucidrains · 2021-05-25T21:26:30Z

@tmphex i believe there's already support for detecting end of string tokens https://github.com/lucidrains/x-transformers/blob/main/x_transformers/autoregressive_wrapper.py#L45

lucidrains · 2021-05-25T21:27:00Z

as for beam search, let me think about it - i saw some paper out there with some fast optimized beam search, and maybe its worth creating a separate repo for that

tmphex · 2021-05-26T05:18:31Z

Thanks @lucidrains for pointing out the eos_token I have created a PR to be able to pass arguments directly to model.generate function rather than need to call model.decoder.generate.

I look forward to try out the optimized beam search once available.

lucidrains · 2021-05-26T18:34:48Z

@tmphex ohh yes, you indeed found a bug, thank you! one other thing is that the eos_token may not work when the batch size is greater than 1, but i'll get that fixed this week (we can keep this issue open)

tmphex · 2021-06-02T09:34:59Z

@lucidrains just pinging if you gotten around fixing the eos_token and adding beam search support 🙏

tmphex · 2021-06-10T15:54:09Z

Newly released Fastseq (https://github.com/microsoft/fastseq) might be interesting to integrate with when you plan to work on optimizing generation and beam search support

tmphex · 2021-08-12T08:26:53Z

@lucidrains seems like eos_token issue has been fixed. If you plan to implement the beam-search then we can keep this issue open otherwise feel free to close it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sequence length independent generation #45

sequence length independent generation #45

tmphex commented May 25, 2021

lucidrains commented May 25, 2021

lucidrains commented May 25, 2021

tmphex commented May 26, 2021 •

edited

Loading

lucidrains commented May 26, 2021

tmphex commented Jun 2, 2021

tmphex commented Jun 10, 2021

tmphex commented Aug 12, 2021

sequence length independent generation #45

sequence length independent generation #45

Comments

tmphex commented May 25, 2021

lucidrains commented May 25, 2021

lucidrains commented May 25, 2021

tmphex commented May 26, 2021 • edited Loading

lucidrains commented May 26, 2021

tmphex commented Jun 2, 2021

tmphex commented Jun 10, 2021

tmphex commented Aug 12, 2021

tmphex commented May 26, 2021 •

edited

Loading