Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about unidirectional #9

Closed
bojone opened this issue May 21, 2019 · 3 comments
Closed

question about unidirectional #9

bojone opened this issue May 21, 2019 · 3 comments

Comments

@bojone
Copy link

bojone commented May 21, 2019

In your paper, you use a unidirectional ON-LSTM to trained a language model and then phrase grammar with the output distance of the pretrained language model. How can we explain that the level of first token is independent with the future tokens? Is there any bidirectional way to do it?

@yikangshen
Copy link
Owner

I think you can try bidirection language model (e.g. elmo) or masked language model (e.g. bert). But the perplexity won't be comparable to previous language models.

@bojone
Copy link
Author

bojone commented May 22, 2019

I am confused of how can we calculate distance if we use bi-on-lstm? the average of distances of forward and backward on-lstm ?

@bojone bojone closed this as completed May 28, 2019
@speedcell4
Copy link

@bojone I get confused on the same question, did you figure it out?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants