Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quickstart setup step 4 (not working) #595

Closed
zaynpatel opened this issue May 21, 2024 Discussed in #235 · 4 comments
Closed

Quickstart setup step 4 (not working) #595

zaynpatel opened this issue May 21, 2024 Discussed in #235 · 4 comments
Assignees

Comments

@zaynpatel
Copy link

Discussed in #235

Originally posted by zaynpatel December 4, 2023
I'm currently running the following command, copied from the documentation with the exception of a new localhost address:

url http://localhost:3000/inferences/llamacpp/loadmodel \
-H 'Content-Type: application/json' \
-d '{
    "llama_model_path": "/model/model/llama-2-7b-model.gguf",
    "ctx_len": 512,
    "ngl": 100
}'

I'm getting a 404 error which references that the inferences/llamacpp/loadmodel is not an available route.

{"url":"/inferences/llamacpp/loadmodel","statusCode":404,"statusMessage":"Cannot find any route matching /inferences/llamacpp/loadmodel.","message":"Cannot find any route matching /inferences/llamacpp/loadmodel."}% 

I'm curious about how to proceed and wonder how I can test what other load model links might be correct?

@fscardua
Copy link

Same problem here.

@0xSage
Copy link
Contributor

0xSage commented Jun 2, 2024

@zaynpatel , @fscardua sorry about this. Cortex is in private preview and has breaking changes daily.

Can you both try the API playground: http://localhost:1337/api
for easy curling?

@louis-jan
Copy link
Contributor

The URL has been changed; now, Cortex will serve the chat/completions endpoint at http://localhost:3000/inferences/server/loadmodel.

@0xSage
Copy link
Contributor

0xSage commented Jul 1, 2024

closing as solved

@0xSage 0xSage closed this as completed Jul 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

No branches or pull requests

5 participants