Skip to content

Model server for language model based on Huggingface pipeline.

License

Notifications You must be signed in to change notification settings

dreamingbird88/model_server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Model server for HuggingFace models.

Setup steps

  1. Build docker image
docker_image_name=ml_model_server
user_name=some_user
docker build \
  -t ${docker_image_name} \
  --build-arg ML_USER_NAME=${user_name} \
  -f Dockerfile . 
  1. Start docker container
port=7860
host_dir=/mnt/new_data/huggingface_models
container_dir=/models
docker run  -it --rm -e PORT=${port} \
  --volume "${host_dir}":$container_dir \
  ${docker_image_name}

Notes

Additional Info

  • Install dependencies via pip.
    pip install --no-cache-dir --upgrade -r requirements.txt
  • Start server without docker. It can use for simple debug.
    export MODEL_PATH=/mnt/new_data/huggingface_models # directory stores models
    uvicorn main:app --host 0.0.0.0 --port 7860
  • Get universal link from "remote.moe". It can be used in demos.
    ssh -R 8000:localhost:7860 remote.moe

About

Model server for language model based on Huggingface pipeline.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published