An example of deploying model using FastAPI and TritonServer

TritonServer: https://github.com/triton-inference-server/server

To start app run:

bash build_and_run_app.sh run

To stop app run:

bash build_and_run_app.sh stop

Once the app starts running:

To check model and triton-server status:

curl localhost:8080/health

To classify an image:

curl --header "Content-Type: application/json" \
       --request POST \
       --data '{"img_ID":Key_ID of the image,"img_Path":Image save path}' \
        localhost:8080/predict

Image file reading and storage have a directory dependency (/home/ubuntu/image_data). If users have image data located in other local directories, users can change the line 56 in build_and_run_app.sh "-v /home/ubuntu/image_data:/image_data" to "-v :/image_data". API container will bind this localhost directory to container volume /image_data. When you send a request to port 8080, img_Path should be "/image_data/<NAME_OF_IMAGE>"

Save a torch script (or any other models based on any popular frameworks, TF, ONNX, Caffe, MXNet, etc.), in form of

<model_repo>

   |
   
   |____<name_of_model>
   
               |
               
               |_______config.pbtxt
               
               |
               
               |_______<1>_______model_version_1.pt
               
               |
               
               |_______<2>_______model_version_2.pt
               
               |
               
               .
               
               .
               
               .

This repo demenstrate an example of using tritonserver with grpc protocal.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
app		app
jit_models/resnet		jit_models/resnet
README.md		README.md
Tricorder_Dockerfile		Tricorder_Dockerfile
build_and_run_app.sh		build_and_run_app.sh
requirements.txt		requirements.txt
test_request.py		test_request.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An example of deploying model using FastAPI and TritonServer

About

Releases

Packages

Languages

artificertxj1/model_deploy_with_fastapi_and_tritonserver

Folders and files

Latest commit

History

Repository files navigation

An example of deploying model using FastAPI and TritonServer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages