feat: integrate Azure OpenAI support #221

mmlmt2604 · 2025-06-18T04:49:24Z

Client Implementation and update documentation

Added AzureAIClient for interacting with Azure OpenAI models.
Updated README.md to include Azure OpenAI API key, endpoint, and version setup instructions.
Modified configuration to support Azure as a model provider.
Enhanced chat completion handling to include Azure API calls in websocket and streaming contexts.
Updated generator configuration to define Azure models and their parameters.

…pdate documentation - Added AzureAIClient for interacting with Azure OpenAI models. - Updated README.md to include Azure OpenAI API key, endpoint, and version setup instructions. - Modified configuration to support Azure as a model provider. - Enhanced chat completion handling to include Azure API calls in websocket and streaming contexts. - Updated generator configuration to define Azure models and their parameters.

gemini-code-assist

Summary of Changes

Hello @mmlmt2604, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces comprehensive support for Azure OpenAI models across the application. It includes the development of a new client for interacting with Azure's API, updates to the core configuration to register Azure as a model provider, and integration into both the standard chat completion and real-time websocket functionalities. Additionally, the documentation has been thoroughly updated to guide users through the setup and usage of Azure OpenAI.

Highlights

New Azure OpenAI Client: A dedicated AzureAIClient has been introduced to facilitate interactions with Azure OpenAI models, supporting both API key and Azure AD token authentication for chat completions and embeddings.
Configuration Integration: The system's configuration (api/config.py and api/config/generator.json) has been updated to recognize Azure as a new model provider, defining its client class and supported models (e.g., gpt-4o, gpt-4, gpt-35-turbo) along with their default parameters.
API and WebSocket Support: Azure OpenAI models are now integrated into the simple_chat and websocket_wiki APIs, enabling chat completions, including streaming responses, and incorporating robust error handling for Azure API calls.
Comprehensive Documentation: The README.md has been extensively updated with instructions for setting up Azure OpenAI environment variables, obtaining credentials, and troubleshooting common API errors, along with updates to diagrams and model provider lists.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request integrates Azure OpenAI, including a new AzureAIClient, updates to configuration, documentation, and integration into the chat completion workflows. The README has been updated with setup instructions. The AzureAIClient implementation is functional but has a few areas for improvement regarding input processing, response parsing, state management, and coding conventions.

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+    def parse_chat_completion(
+        self,
+        completion: Union[ChatCompletion, Generator[ChatCompletionChunk, None, None]],
+    ) -> "GeneratorOutput":
+        """Parse the completion, and put it into the raw_response."""
+        log.debug(f"completion: {completion}, parser: {self.chat_completion_parser}")
+        try:
+            data = self.chat_completion_parser(completion)
+            usage = self.track_completion_usage(completion)
+            return GeneratorOutput(
+                data=None, error=None, raw_response=data, usage=usage
+            )
+        except Exception as e:
+            log.error(f"Error parsing the completion: {e}")
+            return GeneratorOutput(data=None, error=str(e), raw_response=completion)


The GeneratorOutput is returned with data=None, while the parsed data is put into raw_response. The parsed data should be in GeneratorOutput.data.

If track_completion_usage is called with a streaming generator, it raises NotImplementedError. This error is then caught here, and GeneratorOutput is returned with an error message. This means that for streaming responses, the GeneratorOutput will always indicate an error related to usage tracking and won't contain the actual streamed data in the data field.

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+            if self._input_type == "messages":
+                system_start_tag = "<START_OF_SYSTEM_PROMPT>"
+                system_end_tag = "<END_OF_SYSTEM_PROMPT>"
+                user_start_tag = "<START_OF_USER_PROMPT>"
+                user_end_tag = "<END_OF_USER_PROMPT>"
+                pattern = f"{system_start_tag}(.*?){system_end_tag}{user_start_tag}(.*?){user_end_tag}"
+                # Compile the regular expression
+                regex = re.compile(pattern)
+                # Match the pattern
+                match = regex.search(input)
+                system_prompt, input_str = None, None
+
+                if match:
+                    system_prompt = match.group(1)
+                    input_str = match.group(2)
+
+                else:
+                    print("No match found.")
+                if system_prompt and input_str:
+                    messages.append({"role": "system", "content": system_prompt})
+                    messages.append({"role": "user", "content": input_str})
+            if len(messages) == 0:
+                messages.append({"role": "system", "content": input})
+            final_model_kwargs["messages"] = messages


The logic for handling input_type seems to incorrectly treat default text input as a system message. Consider revising this to correctly assign roles based on _input_type and handle regex mismatches more explicitly for the "messages" type.

gemini-code-assist · 2025-06-18T04:51:43Z

README.md

@@ -365,6 +386,9 @@ You can also mount a .env file to the container:
 echo "GOOGLE_API_KEY=your_google_api_key" > .env
 echo "OPENAI_API_KEY=your_openai_api_key" >> .env
 echo "OPENROUTER_API_KEY=your_openrouter_api_key" >> .env
+echo "AZURE_OPENAI_API_KEY=your_azure_openai_api_key" >> .env
+echo "AZURE_OPENAI_ENDPOINT=your_azure_openai_endpoint" >> .env
+echo "AZURE_OPENAI_VERSION=your_azure_openai_version"  >>.env


Typo in the echo command. It should be >> .env to correctly append to the .env file.

Suggested change

echo "AZURE_OPENAI_VERSION=your_azure_openai_version" >>.env

echo "AZURE_OPENAI_VERSION=your_azure_openai_version" >> .env

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+
+__all__ = ["AzureAIClient"]
+
+# TODO: this overlaps with openai client largely, might need to refactor to subclass openai client to simplify the code


Consider refactoring to reduce code duplication with OpenAIClient. A common base class would improve maintainability.

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+    log_probs = []
+    for c in completion.choices:
+        content = c.logprobs.content
+        print(content)


Remove this print statement or replace it with log.debug().

log.debug(f"Logprobs content: {content}") # Or remove if not needed

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+        # added api_type azure for azure Ai
+        self.api_type = "azure"
+        self._api_key = api_key
+        self._apiversion = api_version


Variable name _apiversion should be _api_version to adhere to PEP 8.

self._api_version = api_version

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+                    input_str = match.group(2)
+
+                else:
+                    print("No match found.")


Remove this print statement or replace it with appropriate logging.

log.warning("Input string did not match the expected system/user prompt pattern when input_type is 'messages'.")

gemini-code-assist · 2025-06-18T04:51:43Z

api/azureai_client.py

+            if "stream" in api_kwargs and api_kwargs.get("stream", False):
+                log.debug("streaming call")
+                self.chat_completion_parser = handle_streaming_response


Modifying self.chat_completion_parser within the call method introduces a side effect that changes the client's state. It's generally better to pass the appropriate parser to parse_chat_completion or have parse_chat_completion determine the parser based on the type of completion object it receives, rather than relying on instance state modified by call parameters.

gemini-code-assist · 2025-06-18T04:51:44Z

api/azureai_client.py

+        obj = super().from_dict(data)
+        # recreate the existing clients
+        obj.sync_client = obj.init_sync_client()
+        obj.async_client = obj.init_async_client()


Initialize obj.async_client to None in from_dict to maintain lazy initialization.

obj.async_client = None # Maintain lazy initialization

…AI#218)

mmlmt2604 added 2 commits June 18, 2025 12:00

refactor: remove temporary ollma embedding for azure in testing phase

61d57ee

gemini-code-assist bot reviewed Jun 18, 2025

View reviewed changes

fix: compatible for old version wiki cache AsyncFuncAI#215 (AsyncFunc…

f7a598d

…AI#218)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: integrate Azure OpenAI support #221

feat: integrate Azure OpenAI support #221

Uh oh!

mmlmt2604 commented Jun 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

gemini-code-assist bot Jun 18, 2025

Uh oh!

Uh oh!

	echo "AZURE_OPENAI_VERSION=your_azure_openai_version" >>.env
	echo "AZURE_OPENAI_VERSION=your_azure_openai_version" >> .env


		__all__ = ["AzureAIClient"]

		# TODO: this overlaps with openai client largely, might need to refactor to subclass openai client to simplify the code

feat: integrate Azure OpenAI support #221

Are you sure you want to change the base?

feat: integrate Azure OpenAI support #221

Uh oh!

Conversation

mmlmt2604 commented Jun 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!