[FSTORE-1820] Documentation for REST API model deployments #504

manu-sj · 2025-08-28T12:19:38Z

docs/user_guides/mlops/serving/rest-api.md

docs/user_guides/mlops/serving/index.md

docs/user_guides/mlops/serving/rest-api.md

javierdlrm · 2025-08-28T13:38:53Z

docs/user_guides/mlops/serving/rest-api.md

+
+The URL follows this format:
+```text
+http://<ISTIO_GATEWAY_IP>/v1/models/<DEPLOYMENT_NAME>:predict


This is not 100% correct. The URL format depends on the model server. For example, /v1/models/:predict works for python deployments, but not for TensorFlow or LLMs.

I would suggest something like:
http://<ISTIO_GATEWAY_IP>/<RESOURCE_PATH>, where RESOURCE_PATH depends on the model server (e.g., vLLM, TensorFlow Serving, KServe sklearnserver).

I updated it based on your suggestion.

javierdlrm · 2025-08-28T13:42:58Z

docs/user_guides/mlops/serving/rest-api.md

+
+## Example Response
+
+The model returns predictions in a JSON object. You can find more information [here](https://kserve.github.io/website/docs/concepts/architecture/data-plane/v1-protocol#response-format).


The model server responses also depend on the model server implementation :)
The { "predictions": [] } format applies to sklearn/xgboost deployments, but TensorFlow Serving or vLLM returns a different format that the one specified in the link.

Aah yes I see your point. I removed the example I have and updated the text to point to a say that the response also depends on the model server.

I could not get a link to their Model Serving Page so I pointed Kserve docs to mention that they can refer their for more information regarding any model servers.

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

docs/user_guides/mlops/serving/index.md

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

…ocks#504) Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

manu-sj added 2 commits August 28, 2025 14:18

adding documentation for model serving

4614f37

removing dummy file

7ad6c3d

manu-sj requested review from SirOibaf and javierdlrm August 28, 2025 12:24

javierdlrm reviewed Aug 28, 2025

View reviewed changes

manu-sj and others added 4 commits August 28, 2025 15:47

Update docs/user_guides/mlops/serving/rest-api.md

3562707

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

Update docs/user_guides/mlops/serving/index.md

13be470

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

Update docs/user_guides/mlops/serving/rest-api.md

96d99ed

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

updated based on review comments

8a7c9d3

javierdlrm reviewed Aug 28, 2025

View reviewed changes

docs/user_guides/mlops/serving/index.md Outdated Show resolved Hide resolved

manu-sj and others added 2 commits August 28, 2025 17:20

Update docs/user_guides/mlops/serving/index.md

7dae412

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

addressing review comments

3fb9140

SirOibaf approved these changes Sep 2, 2025

View reviewed changes

SirOibaf merged commit 7c0c041 into logicalclocks:main Sep 2, 2025
1 check passed

manu-sj added a commit to manu-sj/logicalclocks.github.io that referenced this pull request Sep 2, 2025

[FSTORE-1820] Documentation for REST API model deployments (logicalcl…

948fcce

…ocks#504) Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

manu-sj mentioned this pull request Sep 2, 2025

[FSTORE-1820][4.4] Documentation for REST API model deployments (#504) #506

Merged

SirOibaf pushed a commit that referenced this pull request Sep 2, 2025

[FSTORE-1820] Documentation for REST API model deployments (#504) (#506)

147b097

Co-authored-by: Javier de la Rúa Martínez <javierdlrm@outlook.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FSTORE-1820] Documentation for REST API model deployments #504

[FSTORE-1820] Documentation for REST API model deployments #504

Uh oh!

manu-sj commented Aug 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

javierdlrm Aug 28, 2025

Uh oh!

manu-sj Aug 28, 2025

Uh oh!

javierdlrm Aug 28, 2025

Uh oh!

manu-sj Aug 28, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		## Example Response

		The model returns predictions in a JSON object. You can find more information [here](https://kserve.github.io/website/docs/concepts/architecture/data-plane/v1-protocol#response-format).

[FSTORE-1820] Documentation for REST API model deployments #504

[FSTORE-1820] Documentation for REST API model deployments #504

Uh oh!

Conversation

manu-sj commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

javierdlrm Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

manu-sj Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

javierdlrm Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

manu-sj Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

manu-sj commented Aug 28, 2025 •

edited

Loading