Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error from load_quant #25

Open
saikatbhattacharya opened this issue May 9, 2023 · 2 comments
Open

Error from load_quant #25

saikatbhattacharya opened this issue May 9, 2023 · 2 comments
Labels
bug Something isn't working good first issue Good for newcomers help wanted Extra attention is needed

Comments

@saikatbhattacharya
Copy link

saikatbhattacharya commented May 9, 2023

I am using AWS P3 8xLarge instance. I was trying to run your code and getting the following error -

Loading model Models/vicuna-7B-1.1-GPTQ-4bit-128g checkpoint Models/vicuna-7B-1.1-GPTQ-4bit-128g/vicuna-7B-1.1-GPTQ-4bit-128g.safetensors
Loading model ...
Found 3 unique KN Linear values.
Warming up autotune cache ...
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████
Found 1 unique fused mlp KN values.
Warming up autotune cache ...
0%| python3: project/lib/Analysis/Allocation.cpp:42: std::pair<llvm::SmallVector, llvm::SmallVector > mlir::triton::getCvtOrder(const mlir::Attribute&, const mlir::
Aborted

@paolorechia
Copy link
Owner

Hey, it looks like some problem on the triton library.
It’s probable that the gptq-for-llama package listed in the requirements doesn’t support this GPU.
Are you able to load oobagooba’s API in this instance?

@paolorechia
Copy link
Owner

Oobagooba’s code use an older version of the library that has better compatibility with more GPUs and environments

@paolorechia paolorechia added bug Something isn't working help wanted Extra attention is needed good first issue Good for newcomers labels May 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants