Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ADR for Model-as-a-Service #6693

Merged
merged 7 commits into from
Jun 20, 2024
Merged

Conversation

TaoChenOSU
Copy link
Contributor

@TaoChenOSU TaoChenOSU commented Jun 12, 2024

Motivation and Context

Related to: #6530, #3992

Description

Contribution Checklist

@TaoChenOSU TaoChenOSU added PR: in progress Under development and/or addressing feedback ai connector Anything related to AI connectors documentation labels Jun 12, 2024
@TaoChenOSU TaoChenOSU self-assigned this Jun 12, 2024
docs/decisions/0046-model-as-a-service.md Outdated Show resolved Hide resolved
docs/decisions/0046-model-as-a-service.md Outdated Show resolved Hide resolved
docs/decisions/0046-model-as-a-service.md Outdated Show resolved Hide resolved
docs/decisions/0046-model-as-a-service.md Outdated Show resolved Hide resolved
@TaoChenOSU TaoChenOSU changed the title WIP: ADR for Model-as-a-Service ADR for Model-as-a-Service Jun 18, 2024
Copy link
Contributor

@moonbox3 moonbox3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@TaoChenOSU TaoChenOSU added this pull request to the merge queue Jun 20, 2024
Merged via the queue into microsoft:main with commit e474fec Jun 20, 2024
12 checks passed
@TaoChenOSU TaoChenOSU deleted the taochen/adr-maas branch June 20, 2024 18:06
@TaoChenOSU TaoChenOSU linked an issue Jun 20, 2024 that may be closed by this pull request
@KSemenenko
Copy link

Is this available for c#?

github-merge-queue bot pushed a commit that referenced this pull request Jun 24, 2024
### Motivation and Context

<!-- Thank you for your contribution to the semantic-kernel repo!
Please help reviewers and future users, providing the following
information:
  1. Why is this change required?
  2. What problem does it solve?
  3. What scenario does it contribute to?
  4. If it fixes an open issue, please link to the issue here.
-->
Related to: #6693

Azure Model-as-a-Service allows users to deploy certain models from the
Azure AI Studio model catalog as an API. This option also provides
pay-as-you-go access to the models hosted. Below are some of the models
that are supported:
- Microsoft Phi-3 family
- Meta Llama family (Llama 2 chat & Llama 3 instruct)
- Mistral-Small & Mistral-Large
- and more

We'd like to provide an AI connector for users of SK to use Azure
Model-as-a-Service.

### Description

<!-- Describe your changes, the overall approach, the underlying design.
These notes will help understanding how your code works. Thanks! -->
A new AI connector named `azure_ai_inference` is added to support Azure
Model-as-a-Service. This connector takes a new dependency on the Python
`azure.ai.inference` SDK.

### Contribution Checklist

<!-- Before submitting this PR, please make sure: -->

- [X] The code builds clean without any errors or warnings
- [X] The PR follows the [SK Contribution
Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md)
and the [pre-submission formatting
script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts)
raises no violations
- [X] All unit tests pass, and I have added new tests where possible
- [X] I didn't break anyone 😄

---------

Co-authored-by: Evan Mattson <35585003+moonbox3@users.noreply.github.com>
LudoCorporateShark pushed a commit to LudoCorporateShark/semantic-kernel that referenced this pull request Aug 25, 2024
### Motivation and Context

<!-- Thank you for your contribution to the semantic-kernel repo!
Please help reviewers and future users, providing the following
information:
  1. Why is this change required?
  2. What problem does it solve?
  3. What scenario does it contribute to?
  4. If it fixes an open issue, please link to the issue here.
-->
Related to: microsoft#6530,
microsoft#3992

### Description

<!-- Describe your changes, the overall approach, the underlying design.
These notes will help understanding how your code works. Thanks! -->

### Contribution Checklist

<!-- Before submitting this PR, please make sure: -->

- [ ] The code builds clean without any errors or warnings
- [ ] The PR follows the [SK Contribution
Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md)
and the [pre-submission formatting
script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts)
raises no violations
- [ ] All unit tests pass, and I have added new tests where possible
- [ ] I didn't break anyone 😄
LudoCorporateShark pushed a commit to LudoCorporateShark/semantic-kernel that referenced this pull request Aug 25, 2024
### Motivation and Context

<!-- Thank you for your contribution to the semantic-kernel repo!
Please help reviewers and future users, providing the following
information:
  1. Why is this change required?
  2. What problem does it solve?
  3. What scenario does it contribute to?
  4. If it fixes an open issue, please link to the issue here.
-->
Related to: microsoft#6693

Azure Model-as-a-Service allows users to deploy certain models from the
Azure AI Studio model catalog as an API. This option also provides
pay-as-you-go access to the models hosted. Below are some of the models
that are supported:
- Microsoft Phi-3 family
- Meta Llama family (Llama 2 chat & Llama 3 instruct)
- Mistral-Small & Mistral-Large
- and more

We'd like to provide an AI connector for users of SK to use Azure
Model-as-a-Service.

### Description

<!-- Describe your changes, the overall approach, the underlying design.
These notes will help understanding how your code works. Thanks! -->
A new AI connector named `azure_ai_inference` is added to support Azure
Model-as-a-Service. This connector takes a new dependency on the Python
`azure.ai.inference` SDK.

### Contribution Checklist

<!-- Before submitting this PR, please make sure: -->

- [X] The code builds clean without any errors or warnings
- [X] The PR follows the [SK Contribution
Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md)
and the [pre-submission formatting
script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts)
raises no violations
- [X] All unit tests pass, and I have added new tests where possible
- [X] I didn't break anyone 😄

---------

Co-authored-by: Evan Mattson <35585003+moonbox3@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ai connector Anything related to AI connectors documentation PR: in progress Under development and/or addressing feedback
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

Python: Add AI connector for Azure MaaS (Including ADR)
6 participants