issues Search Results · repo:OpenCSGs/llm-inference language:Python
Filter by
34 results
(69 ms)34 results
inOpenCSGs/llm-inference (press backspace or delete to remove)for example, ray cluster just has 12 cpus.
curl -H Content-Type: application/json -H user-name: default -d [{ model_id : facebook/opt-125m , model_task : text-generation , model_revision : ...
SeanHH86
- Opened on May 8, 2024
- #134
SeanHH86
- 1
- Opened on May 7, 2024
- #132
1:job_id:04000000
:actor_name:ServeReplica:default:opencsg--csg-wukong-1B
[INFO 2024-04-30 03:46:04,636] __init__.py: 14 Import vllm related stuff failed, please make sure vllm is installed.
INFO 2024-04-30 ...
SeanHH86
- 1
- Opened on Apr 30, 2024
- #128
depenglee1707
- Opened on Apr 24, 2024
- #123
initialization:
runtime_env:
env_vars:
HF_ENDPOINT: https://hub.opencsg.com/hf
initializer:
type: Vllm
from_pretrained_kwargs:
trust_remote_code: true
pipeline: ...
depenglee1707
- 2
- Opened on Apr 23, 2024
- #120
depenglee1707
- Opened on Apr 23, 2024
- #117
depenglee1707
- Opened on Apr 23, 2024
- #116
for now the generation params is addressed in yaml files, add the ability reset these params on fly is useful:
generate_kwargs:
do_sample: false
max_new_tokens: 512
min_new_tokens: ...
enhancement
depenglee1707
- Opened on Apr 17, 2024
- #104

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.
Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.