Skip to content

Commit

Permalink
Add missing docs images
Browse files Browse the repository at this point in the history
  • Loading branch information
pseudotensor committed Jun 13, 2024
1 parent 0c0a5c7 commit d3f7396
Show file tree
Hide file tree
Showing 5 changed files with 3 additions and 1 deletion.
Binary file added docs/ollama_max_seq_len.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/ollama_setup.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/ollama_use.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 2 additions & 0 deletions src/gen.py
Original file line number Diff line number Diff line change
Expand Up @@ -655,6 +655,8 @@ def main(
vllm_chat:https://vllm.h2o.ai:5001:/1b1219f7-4bb4-43e9-881f-fa8fa9fe6e04/v1:1234ABCD
where vllm.h2o.ai is the DNS name of the IP, 5001 is the port, /1b1219f7-4bb4-43e9-881f-fa8fa9fe6e04/v1 is the url of the "page" to access, and 1234ABCD is the api key
If you have any other OpenAI compatible chat completion endpoint, you should use vllm_chat way. E.g. llama.cpp http server: https://github.com/ggerganov/llama.cpp/tree/master/examples/server
For sglang, text models are supported via OpenAI API and can use vllm_chat or vllm as usual.
For sglang and vision models, need to specify sglang so we use http requests API via generate endpoint. Use "sglang" prefix and otherwise it is like vllm endpoint
Currently it's not clear how to make an API key work: https://github.com/sgl-project/sglang/issues/466, so one should rely upon firewalls
Expand Down
2 changes: 1 addition & 1 deletion src/version.py
Original file line number Diff line number Diff line change
@@ -1 +1 @@
__version__ = "14ae688c2e44ac1062fe7cf70b4d97a41fa45f0a"
__version__ = "0c0a5c7648c203f578a28a05b9b7761619334294"

0 comments on commit d3f7396

Please sign in to comment.