Install instructions are not clear #12

imbalu007 · 2024-01-15T14:46:35Z

Hi,
I find the following missing from the install instructions

How do I install autocompressors package?
What should I install to just perform inference (i.e., obtain soft prompts for a given prompt)?
How can I run (2) without flash-attention?

CodeCreator · 2024-02-19T07:00:29Z

Thanks! I've clarified the installation instructions in the README. The general outline is to clone the repo, install dependencies and run the example inference code in the README. Unfortunately, the Llama code requires flash-attention (and there seems to be a performance gap when training the model with flash-attention and running inference without it). The OPT AutoCompressor does not use flash attention by default.

CodeCreator · 2024-03-25T01:24:57Z

Closing this to due to inactivity -- feel free to re-open!

CodeCreator closed this as completed Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Install instructions are not clear #12

Install instructions are not clear #12

imbalu007 commented Jan 15, 2024 •

edited

Loading

CodeCreator commented Feb 19, 2024

CodeCreator commented Mar 25, 2024

Install instructions are not clear #12

Install instructions are not clear #12

Comments

imbalu007 commented Jan 15, 2024 • edited Loading

CodeCreator commented Feb 19, 2024

CodeCreator commented Mar 25, 2024

imbalu007 commented Jan 15, 2024 •

edited

Loading