fix: check model status before inferencing #1864

vansangpfiev · 2025-01-16T03:13:01Z

Describe Your Changes

This pull request introduces several changes to the InferenceService class and related methods to improve model handling and logging. The most important changes include adding logic to handle model loading, updating logging levels, and modifying the storage of saved models.

Improvements to model handling and storage:

engine/services/inference_service.cc: Added logic to check if a model is loaded and start loading it if not. This includes retrieving the model status and initiating the model loading process if necessary.
engine/services/inference_service.cc: Removed redundant code for retrieving the model_id inside a nested block and consolidated it at the beginning of the function.
engine/services/inference_service.cc: Added logic to save models in the saved_models_ map when they are loaded, ensuring they can be reused later.
engine/services/inference_service.h: Introduced a new SavedModel type and an unordered_map to store saved models, facilitating efficient model retrieval.

Logging improvements:

engine/services/inference_service.cc: Changed the logging level from CTL_INF to CTL_DBG for the JSON body inference message to reduce log verbosity.

Fixes Issues

idea: cortex should auto-reload model when start chat completion #1863

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

fix: check model status before inferencing

be385b7

vansangpfiev force-pushed the fix/reload-model-when-switch-engine branch from d996d40 to be385b7 Compare January 16, 2025 05:05

vansangpfiev marked this pull request as ready for review January 16, 2025 05:24

vansangpfiev requested a review from nguyenhoangthuan99 January 16, 2025 05:24

nguyenhoangthuan99 approved these changes Jan 16, 2025

View reviewed changes

vansangpfiev merged commit d847779 into main Jan 16, 2025
12 checks passed

vansangpfiev deleted the fix/reload-model-when-switch-engine branch January 16, 2025 09:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: check model status before inferencing #1864

fix: check model status before inferencing #1864

Uh oh!

vansangpfiev commented Jan 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: check model status before inferencing #1864

fix: check model status before inferencing #1864

Uh oh!

Conversation

vansangpfiev commented Jan 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe Your Changes

Fixes Issues

Self Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vansangpfiev commented Jan 16, 2025 •

edited

Loading