Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does Hyena support BERT style LLM? #32

Open
sherdencooper opened this issue Jul 13, 2023 · 3 comments
Open

Does Hyena support BERT style LLM? #32

sherdencooper opened this issue Jul 13, 2023 · 3 comments

Comments

@sherdencooper
Copy link

Hi, thanks for this awesome work! I am wondering if this could be applied to Bert style model since the paper describe that hyena filter preserves causality in order to predict only depending on the past. I have read your HyenaDNA paper and am thinking about use Hyena in my project, which needs looking from both future and past. Thanks a lot in advance.

@DanFu09
Copy link
Contributor

DanFu09 commented Jul 14, 2023

We've been playing around with things a bit along these lines - will update this issue when we have something more concrete!

@ddofer
Copy link

ddofer commented Sep 7, 2023

+1 - For discriminative models, bidirectional is the SOTA for a given parameter/size count (vs causal models). A bidirectional (MLM, ELECTRA etc' pretraining + expected input) would be amazing. Especially for Hyena DNA! (I have ideas on this front)

@DanFu09
Copy link
Contributor

DanFu09 commented Sep 12, 2023

Check out Monarch Mixer for BERT-style models: https://github.com/HazyResearch/m2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants