[Feature] Support for Azure AI Studio #779

riedgar-ms · 2024-04-26T15:06:56Z

Add support for models deployed in Azure AI Studio. This has been done by combining the code for OpenAI models, and the same provided by Azure AI Studio.

Since there are a bunch of common test cases which need to run with multiple models, start refactoring those a bit as well (and hook the Azure OpenAI tests into this). This isn't using the same mechanism as the testing of local models, since we won't be running into trouble with fitting multiple LLMs on a single machine.

riedgar-ms

Some questions about how best to implement this

riedgar-ms · 2024-04-26T15:14:08Z

guidance/models/_azureai_studio.py

+    def _generator(self, prompt, temperature: float):
+        # Initial parts of this straight up copied from OpenAIChatEngine
+
+        # The next loop (or one like it) appears in several places,


Thoughts on this?

This is a straight-up copy of what is in _open_ai.py, but there are almost-but-not-quite the same versions elsewhere. Chopping up the internal 'guidance state string' into individual chat messages feels like a task that all chat model will need - and hence should be centrall provided. Individual engines can then turn that format into whatever they precisely need.

riedgar-ms · 2024-04-26T15:14:46Z

guidance/models/_azureai_studio.py

+    def __init__(
+        self,
+        *,
+        tokenizer,


I never try setting the tokeniser, and it appears that it eventually defaults to GPT2. I don't quite see why a remote model like this would even need it.

OK, so theoretically for token healing. However, I have a feeling that trying to figure out what tokeniser to use will be an exercise in fragility.

riedgar-ms · 2024-04-26T15:16:06Z

guidance/models/_azureai_studio.py

+        self._reset_shared_data(prompt[:pos], temperature)
+
+        # Use cache only when temperature is 0
+        if temperature == 0:


This caching logic is a bit of a concern - at least for models where T=0 doesn't actually get determinism. And in general, it means that a bunch of our tests might not quite be doing what we think they're doing, because they may just be hitting the cache.

The more I think about it, the less I like the idea of a disk-based cache. In some ways it's worse on the OpenAI side, where both AzureOpenAI and OpenAI will wind up sharing the same cache.

How much speed up does it really give, compared to the Heisenbug potential it represents?

I think most models have more reliable temp=0 determinism now, but agree that perhaps sharing between AzureOpenAI and OpenAI is problematic (though there shouldn't be differences between the two APIs in theory?). I do think caching is a nice feature to have in general, as production workflows often have shared inputs coming in that save time and money to reuse.

I have added a clear_cache argument to the constructor. I think I should add this to the OpenAI side as well, in a separate PR.

guidance/models/_azureai_studio.py

riedgar-ms · 2024-04-26T15:18:15Z

guidance/models/_azureai_studio.py

+        req = urllib.request.Request(self._endpoint, body, headers)
+
+        response = urllib.request.urlopen(req)
+        result = json.loads(response.read())


The logic around result is another thing which might be model specific.

codecov-commenter · 2024-04-26T15:24:39Z

Codecov Report

Attention: Patch coverage is 22.89157% with 64 lines in your changes are missing coverage. Please review.

Project coverage is 62.05%. Comparing base (8caf911) to head (2327c8a).

Files	Patch %	Lines
guidance/models/_azureai_studio.py	21.95%	64 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #779      +/-   ##
==========================================
+ Coverage   55.42%   62.05%   +6.63%     
==========================================
  Files          55       56       +1     
  Lines        4076     4159      +83     
==========================================
+ Hits         2259     2581     +322     
+ Misses       1817     1578     -239

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

riedgar-ms · 2024-04-29T15:01:15Z

tests/models/test_azureai_studio.py

+    assert isinstance(lm, models.AzureAIStudioChat)
+    lm.engine.cache.clear()
+
+    # No "system" role for Mistral?


This makes me unhappy

riedgar-ms · 2024-04-30T12:27:08Z

.github/workflows/ci_tests.yml

@@ -58,13 +58,23 @@ jobs:
          python -c "import torch; assert torch.cuda.is_available()"
      - name: Test with pytest
        env:
-          # Configure endpoints
+          # Configure endpoints for Azure OpenAI
          AZUREAI_CHAT_ENDPOINT: ${{ secrets.AZUREAI_CHAT_ENDPOINT }}


We should probably change the 'ENDPOINT' variables here to be vars rather than secrets

riedgar-ms · 2024-04-30T12:27:25Z

.github/workflows/ci_tests.yml

          AZUREAI_CHAT_ENDPOINT: ${{ secrets.AZUREAI_CHAT_ENDPOINT }}
          AZUREAI_CHAT_KEY: ${{ secrets.AZUREAI_CHAT_KEY }}
          AZUREAI_CHAT_MODEL: ${{ secrets.AZUREAI_CHAT_MODEL }}
          AZUREAI_COMPLETION_ENDPOINT: ${{ secrets.AZUREAI_COMPLETION_ENDPOINT }}
          AZUREAI_COMPLETION_KEY: ${{ secrets.AZUREAI_COMPLETION_KEY }}
          AZUREAI_COMPLETION_MODEL: ${{ secrets.AZUREAI_COMPLETION_MODEL }}
+          # Configure endpoints for Azure AI Studio
+          AZURE_AI_STUDIO_PHI3_ENDPOINT: ${{ vars.AZURE_AI_STUDIO_PHI3_ENDPOINT }}


@Harsha-Nori we will need to work together on getting these into the repository

Harsha-Nori · 2024-04-30T19:39:12Z

guidance/models/_azureai_studio.py

+                ("system", b"<|im_start|>system\n"),
+                ("user", b"<|im_start|>user\n"),
+                ("assistant", b"<|im_start|>assistant\n"),


Do AzureAI models uniformly use the same role tags across their models? I don't think we can hard code a check for these start_bytes in this class

These aren't coming from the model, surely? They're coming from guidance as it sends its current prompt back into this class, to be formatted and forwarded on to the model. Or.... have I managed to miss another action-at-a-distance part of guidance?

guidance/models/_azureai_studio.py

tests/models/test_azureai_studio.py

Harsha-Nori · 2024-04-30T19:44:57Z

tests/models/test_model.py

+    print(f"{dir(lm.engine.tokenizer)=}")
+    assert hasattr(lm.engine.tokenizer,"sp_model")


We don't want to have this assert for every model -- not every tokenizer is sentencepiece based.

…-studio-support-01

riedgar-ms added 5 commits April 26, 2024 09:07

Starting to think about what we need for AzureAI Studio

386cdd3

Getting to the initially desired failure

176201c

Very rough draft....

beac0cc

Inching along

3d90baa

Trying to get things working :-/

32bc793

riedgar-ms requested review from slundberg and Harsha-Nori and removed request for slundberg April 26, 2024 15:06

riedgar-ms added 2 commits April 26, 2024 11:10

Didn't mean to check that in

7840cfd

Erroneous addition

04e45c7

riedgar-ms commented Apr 26, 2024

View reviewed changes

riedgar-ms added 5 commits April 29, 2024 08:16

Merge branch 'main' into riedgar-ms/azure-ai-studio-support-01

bcc241a

Switch to requests

25ecccf

Make sure that cache is unique to endpoint/deployment

1265346

Starting to test mistral too.... not fully working yet

f348880

Get the Mistral test working

0fc4727

riedgar-ms commented Apr 29, 2024

View reviewed changes

riedgar-ms added 2 commits April 29, 2024 11:24

Add LLama3

2be7f58

Expand the endpoint configuration

3bcb48e

riedgar-ms commented Apr 30, 2024

View reviewed changes

Harsha-Nori reviewed Apr 30, 2024

View reviewed changes

guidance/models/_azureai_studio.py Show resolved Hide resolved

Harsha-Nori reviewed Apr 30, 2024

View reviewed changes

tests/models/test_azureai_studio.py Outdated Show resolved Hide resolved

Harsha-Nori reviewed Apr 30, 2024

View reviewed changes

riedgar-ms added 3 commits May 1, 2024 08:18

Merge remote-tracking branch 'upstream/main' into riedgar-ms/azure-ai…

559b341

…-studio-support-01

Add option to clear cache on instaniating model

9e46101

Some more experimenting

0a7cc81

riedgar-ms added 6 commits May 1, 2024 09:58

Want some parallel Azure OpenAI tests

1584d9f

Copy/paste error

60b23c8

Change test to passing

10fc9ba

Expand Azure AI Studio testing

b68e9d7

Refactor tests

9a4c1a8

Refactor tests

c0769b6

riedgar-ms requested a review from Harsha-Nori May 1, 2024 14:54

Merge branch 'main' into riedgar-ms/azure-ai-studio-support-01

2a60c94

riedgar-ms changed the title ~~[WIP][Feature] Support for Azure AI Studio~~ [Feature] Support for Azure AI Studio May 2, 2024

riedgar-ms added 10 commits May 2, 2024 06:38

Start doc writing

9c755d6

Add some basic docs

21ee13f

Use the new endpoint

cdf679c

Handle optional import

4281d7f

OpenAI guard mk II

7bf3d07

Merge remote-tracking branch 'upstream/main' into riedgar-ms/azure-ai…

c973076

…-studio-support-01

Small fixes for mypy

50847c5

One suppression....

3d73c42

More mypy fixing

64ec232

Merge remote-tracking branch 'upstream/main' into riedgar-ms/azure-ai…

2327c8a

…-studio-support-01

riedgar-ms merged commit 631ff1a into guidance-ai:main May 6, 2024
104 checks passed

riedgar-ms deleted the riedgar-ms/azure-ai-studio-support-01 branch May 6, 2024 17:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support for Azure AI Studio #779

[Feature] Support for Azure AI Studio #779

riedgar-ms commented Apr 26, 2024 •

edited

riedgar-ms left a comment

riedgar-ms Apr 26, 2024

riedgar-ms Apr 26, 2024

riedgar-ms Apr 30, 2024

riedgar-ms Apr 26, 2024

riedgar-ms Apr 26, 2024 •

edited

Harsha-Nori Apr 30, 2024

riedgar-ms May 1, 2024

riedgar-ms Apr 26, 2024

codecov-commenter commented Apr 26, 2024 •

edited

riedgar-ms Apr 29, 2024

riedgar-ms Apr 30, 2024

riedgar-ms Apr 30, 2024

Harsha-Nori Apr 30, 2024

riedgar-ms Apr 30, 2024

Harsha-Nori Apr 30, 2024

		print(f"{dir(lm.engine.tokenizer)=}")
		assert hasattr(lm.engine.tokenizer,"sp_model")

[Feature] Support for Azure AI Studio #779

[Feature] Support for Azure AI Studio #779

Conversation

riedgar-ms commented Apr 26, 2024 • edited

riedgar-ms left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

riedgar-ms Apr 26, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-commenter commented Apr 26, 2024 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

riedgar-ms commented Apr 26, 2024 •

edited

riedgar-ms Apr 26, 2024 •

edited

codecov-commenter commented Apr 26, 2024 •

edited