Can we support the locally deployed private models directly through HTTP, instead of models deployed through frameworks(LocalAI，Ollama etc.)?? #2198

KateHed · 2024-01-25T05:58:13Z

Self Checks

I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to file this report (我已阅读并同意 Language Policy).

1. Is this request related to a challenge you're experiencing?

We have to add support for using local models directly through HTTP requests, rather than using a fixed model provider?

2. Describe the feature you'd like to see

We want to using local models directly through HTTP requests.

3. How will this feature improve your workflow or experience?

This feature will help us use local model.

4. Additional context or comments

No response

5. Can you help us with this feature?

I am interested in contributing to this feature.

dosubot · 2024-01-25T05:58:35Z

🤖

Hello @KateHed! It's great to cross paths with you again in the dify repository. Hope you're having a good day!

Based on the current implementation of dify (v0.5.0), it does support the integration of local models through HTTP requests. This is indicated by the presence of 'localai' in the list of model providers in the _position.yaml file. This suggests that dify can interact with a local AI model, likely through HTTP requests. You can find more details in the api/core/model_runtime/model_providers/_position.yaml file.

Dify integrates with local models through HTTP requests by providing a set of APIs that allow users to interact with the models. These APIs include functionalities like getting a list of model providers, getting the credentials for a specific model provider, validating the credentials for a specific model provider, saving or deleting the credentials for a specific model provider, and more. You can find more details in the api/controllers/console/workspace/model_providers.py file.

The process of integrating a local model through HTTP requests in dify involves defining the model provider and configuration method, generating the form schemas based on the provider and configuration method, validating the form values, saving or removing the credentials, and rendering the form. You can find more details in the web/app/components/header/account-setting/model-provider-page/model-modal/index.tsx file.

Please note that the actual HTTP requests are likely made in the saveCredentials and removeCredentials utility functions, which are not included in the provided context. Therefore, the exact details of the HTTP requests (such as the request method, URL, and body) are not available.

I hope this information helps! If you have any other questions or need further clarification, feel free to ask.

Best regards,
Dosu

Sources

This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

If you want to continue the conversation, start your reply with @dosu-bot.

takatost · 2024-01-25T06:21:05Z

We're currently offering an OpenAI compatible API for direct integration with your personally developed model inference API.

KateHed · 2024-01-25T06:47:03Z

We're currently offering an OpenAI compatible API for direct integration with your personally developed model inference API.

Our models are not compatible with OpenAI's interface, so they cannot be directly called through the openAI-api-compatible

crazywoola · 2024-01-25T08:02:44Z

Technically, it's feasible to implement, but we can't just implement each one as they come along since there are many, and it's quite time-consuming. Therefore, developers can refer to the 'model_runtime' section and integrate it on their own. We need to converge some edge cases to focus on developing our main features.

KateHed added the 💪 enhancement New feature or request label Jan 25, 2024

dosubot bot added the NewModelSupport label Jan 25, 2024

crazywoola closed this as not planned Won't fix, can't repro, duplicate, stale Jan 25, 2024

dosubot bot mentioned this issue Feb 21, 2024

How to implement custom LLM model and use it in Dify? #2509

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we support the locally deployed private models directly through HTTP, instead of models deployed through frameworks(LocalAI，Ollama etc.)?? #2198

Can we support the locally deployed private models directly through HTTP, instead of models deployed through frameworks(LocalAI，Ollama etc.)?? #2198

KateHed commented Jan 25, 2024

dosubot bot commented Jan 25, 2024 •

edited

Loading

takatost commented Jan 25, 2024

KateHed commented Jan 25, 2024

crazywoola commented Jan 25, 2024 •

edited

Loading

Can we support the locally deployed private models directly through HTTP, instead of models deployed through frameworks(LocalAI，Ollama etc.)?? #2198

Can we support the locally deployed private models directly through HTTP, instead of models deployed through frameworks(LocalAI，Ollama etc.)?? #2198

Comments

KateHed commented Jan 25, 2024

Self Checks

1. Is this request related to a challenge you're experiencing?

2. Describe the feature you'd like to see

3. How will this feature improve your workflow or experience?

4. Additional context or comments

5. Can you help us with this feature?

dosubot bot commented Jan 25, 2024 • edited Loading

Sources

takatost commented Jan 25, 2024

KateHed commented Jan 25, 2024

crazywoola commented Jan 25, 2024 • edited Loading

dosubot bot commented Jan 25, 2024 •

edited

Loading

crazywoola commented Jan 25, 2024 •

edited

Loading