Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llava requirements #4

Open
icrto opened this issue Apr 30, 2024 · 3 comments
Open

llava requirements #4

icrto opened this issue Apr 30, 2024 · 3 comments

Comments

@icrto
Copy link

icrto commented Apr 30, 2024

Could you please provide the requirements.txt for llava?

Thanks!

@ys-zong
Copy link
Owner

ys-zong commented Apr 30, 2024

You can install Llava from their original repo: https://github.com/haotian-liu/LLaVA Does that work for you?

@icrto
Copy link
Author

icrto commented May 1, 2024

Yes, that works for me. However, I have not been able to reproduce the results in the paper, and as you mentioned here you were using another version of llava, I thought it might have to do with that, hence my request for the specific package versions you used.

As an example, on Fast Open-Ended MiniImageNet with LLaVA-Next-7B with 2 shots and a detailed description you report (in table 47) an accuracy of 33.67 ± 2.25 while I obtain 14.0.
On Operator Induction with:

  • 0 shots you report 10.56 ± 1.57 while I obtain 16.67
  • 4 shots you report 3.33 ± 2.72 while I obtain 11.67

(This is after I remove the truncation as mentioned in the link.)

@ys-zong
Copy link
Owner

ys-zong commented May 31, 2024

Sorry for the late reply. I just re-run Llava from their latest code and I can reproduce the reported accuracies with marginal difference. I don't have a very clear idea of why there is a huge differences. I'll aim to refactor Llava-next to the Huggingface implementation soon for a more stable reproduction.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants