Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Construct SGPT #11

Closed
tsamarahrana opened this issue Oct 28, 2022 · 4 comments
Closed

Construct SGPT #11

tsamarahrana opened this issue Oct 28, 2022 · 4 comments

Comments

@tsamarahrana
Copy link

Hello, I am currently working on my project and I'm interested on your paper.

First of all, is it correct that SGPT is based on GPT-Neo? If yes, it is possible to construct SGPT that is based on GPT-2?
How to construct it from scratch?

Thank you

@Muennighoff
Copy link
Owner

Hey, yes that's correct.

If you want to create SGPT as a Bi-Encoder for embeddings (SGPT-BE), then you need to finetune GPT-2 using the code in this repo.

If you want to create SGPT as a Cross-Encoder for reranking, then you just need to swap out GPT-Neo for GPT-2 in the inference script.

@tsamarahrana
Copy link
Author

Thank you for the fast reply.

I want to create SGPT as Bi-Encoder. Would you please tell me which code to be exact?

@Muennighoff
Copy link
Owner

Thank you for the fast reply.

I want to create SGPT as Bi-Encoder. Would you please tell me which code to be exact?

Sure, there are insturctions in the README: https://github.com/Muennighoff/sgpt/tree/main/biencoder/nli_msmarco

Let me know if something is unclear!

@tsamarahrana
Copy link
Author

Okay I’ll see how it goes, thank you for the guide!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants