Overview

This is a minimal Express.js server with a simple-to-use web client that works with Azure OpenAI and OpenAI endpoints.

Current version supports the API of /v1/responses and classic /v1/chat/completions as well as streaming audio and DeepSeek models.

The server can be started quickly in VSCode and in the local docker container. I host the server on the Azure Container App.

This server supports:

Key and keyless Entra ID authentication for Azure OpenAI.
The use of streaming outputs in OpenAI and Azure OpenAI, cancelling the streaming request with the Stop button.
Options to formatting AI outputs for code blocks and markdown content.
Working with chat history context. Samples for code generations and text prompts.
Error handling and fallback logic to plain request handling when the streaming option is not supported by the selected language model.
Handling of OpenAI audio streaming.

Technical stack:

Node.js, Express server, OpenAI module, @azure/identity for API key and keyless Entra ID authentication, plain JavaScript, and index.html.
highlight.js, marked.js

Updates and bug fixes:

March 22, 2025, v1.1.0

Revised the logic
Added support for the API of /v1/responses
Added two models o1-pro and computer-use-preview
- These two models could only use /v1/responses. They did not support classic /v1/chat/completions
- o1-pro worked slow, and it's the most expensive if compared with other models.
Changed handling for the models gpt-4.5-preview and gpt-4o (full) to use the new API of /v1/responses
- /v1/responses can be enabled for other models by adding them to the array public/browser-page.js > modelsThatSupportResponses
- Other models that supported /v1/responses:
  - gpt-4o-mini
  - gpt-4-32k-0314
  - gpt-3.5-turbo
  - o1
  - o3-mini
- Models that did not support /v1/responses:
  - o1-mini
  - o1-preview

March 17, 2025, v1.0.12

Added gpt-4o-mini-search-preview and gpt-4o-search-preview with out-of-the-box internet search

March 9, 2025, v1.0.11

Added gpt-4o-audio-preview and gpt-4o-mini-audio-preview with streaming voice output

March 7, 2025, v1.0.10

Added three embedding models
- text-embedding-3-large, 3072 dimensions
- text-embedding-3-small, 1536 dimensions
- text-embedding-ada-002, default dimensions (1536)
Updated .env.example to the newer AZURE_OPENAI_API_VERSION=2024-12-01-preview

March 4, 2025, v1.0.9

Added optional support for deepseek-chat and deepseek-reasoner models that correspond to DeepSeek V3 and DeepSeek R1 accordingly.
To enable them, sign up to https://platform.deepseek.com/, create an API key and deposit min 2$.
Update DEEPSEEK_API_KEY with your API key. Now, you can start using DeepSeek models within this web app.

March 1, 2025, v1.0.8

Added gpt-4.5-preview.

February 15, 2025, v1.0.6

The streaming option appeared in the full-scale o1 models.
Added gradient header text, Thinking..., which is visible while the model is not streaming or does not support streaming output.
Added the ability to freeze automatic scrolling if the user's scroll position is not at the bottom of the text.

February 9, 2025, v1.0.5

Added user interface support for mobile devices.
Added a Dockerfile for deployment to containers.

February 8, 2025, v1.0.4

Default configuration does not require Azure OpenAI. Use your regular OpenAI endpoints and explicitly configure specific ones to be handled by Azure OpenAI. Colleagues commented that they did not have access to Azure OpenAI outside Microsoft environment.
Added o3-mini, o1, o1-2024-12-17, gpt-3.5-turbo, and gpt-4-32k-0314 to available default selections. You can add more models to index.html.
- As of Feb 8, 2025, the model o3-mini were available for OpenAI users that have Tier 3 or higher.
- The full models o1 and o1-2024-12-17 did not have the streaming option in API. I added fallback to the regular handling to support these models.
Added sample code- and text- prompts rotated by new buttons Code and Text respectively; added Clear and Reset buttons.
Improved error handling.
Added handling for the Enter and Space keys:
- If the user clicks Enter in the prompt box, this initiates the request. The user can interrupt the process by pressing Enter or Space.
- The user can enter multiple lines into the prompt box by holding the Shift, Alt, or Ctrl key and pressing Enter. This does not initiate the request.
Added the option to disable streaming for all models.
- Streaming is ON by default for all models. Models that do not support streaming options automatically fallback to the regular request processing.
- Uncomment the line NO_STREAMING=true in your .env file to disable streaming. Please refer to .env.example for details.

Bugs fixed:

Duplicates in the conversation history that appeared after the second request.
The experimental switch of node --watch caused infinite loops occasionally. For instance, when I used npm run dev on the first load.
- I replaced node --watch with the old good nodemon. Now npm run dev can be used for the dynamic reloads on file updates.
Azure OpenAI used default model deployment for different model selections. I moved the logic to the route handler.

Getting started

Sign up for the OpenAI API at https://platform.openai.com/

Tier 0: The free trial provides limited use of the model gpt-4o-mini.
Tier 1: This tier allows for comfortable usage of the models gpt-4o-mini, gpt-4o, o1-mini, o1-preview, gpt-3.5-turbo, and gpt-4-32k-0314.
- To obtain this tier, OpenAI requires you to deposit $5.
Tier 3: This tier provides access to the full-scale o1 models and the newest o3-mini as of February 2025.
- To obtain this tier, OpenAI requires you to deposit $100.
You can review all available tiers at https://platform.openai.com/docs/guides/rate-limits?tier=tier-one#usage-tiers

Clone the project repository, open it in your preferred editor, such as Visual Studio Code.

Create your own .env file using env.example as a template, and adjust your values as needed.

AZURE_OPENAI_ENDPOINT=https://<your-azure-openai-instance>.openai.azure.com
# Comment out the next line if you are going to use Azure OpenAI keyless authentication. Start your express server using node server-entraid.js instead of node server.js.
AZURE_OPENAI_API_KEY=<your apiKey for Azure OpenAI> # Unless you use keyless
...
OPENAI_API_KEY=<your apiKey for regular OpenAI> # This key is used by o1-mini and o1-preview models unavailable at Azure OpenaI as of December 1, 2024

npm i

node server.js

OR

npm run dev

http://localhost:3000

Minimal ChatGPT server powered by express.js with streaming outputs

By default, the server supports the following types of requests to OpenAI instances:

OpenAI with Bearer <apiKey>
Azure OpenAI with <api-key>
Azure OpenAI with keyless authentication using the Entra ID provider
DeepSeek with Bearer <apiKey>

To test Azure OpenAI chats, establish desired deployments on your Azure OpenAI resource. For instance:

gpt-4o-mini
gpt-4o
- Values of the models should match with the established Azure OpenAI deployments.
- These values can be adjusted for each Azure OpenAI enabled model in public/index.html.

Additionally, there should be two or more active models available at your regulat OpenAI API prepaid API service.

As of December 1, 2024, o1-models are not yet publicly available in Azure OpenAI. The server utilizes these models through the regular OpenAI API subscription.

o1-preview
o1-mini
- Update: this model appeared in Azure OpenAI as of Jan, 2025. Its streaming option was enabled in March 2025.

You can add the desired models to the file public/index.html.

<select class="model">
  <option value="o1-pro">o1-pro (warning: the most expensive model)</option>
  <option value="computer-use-preview">computer-use-preview</option>
  <option value="gpt-4o-mini-search-preview">gpt-4o-mini-search-preview</option>
  <option value="gpt-4o-search-preview">gpt-4o-search-preview</option>
  <option value="gpt-4.5-preview">gpt-4.5-preview</option>
  <option value="gpt-4o-audio-preview">gpt-4o-audio-preview (warning: loud voice out)</option>
  <option value="gpt-4o-mini-audio-preview">gpt-4o-mini-audio-preview (warning: loud voice out)</option>
  <option value="gpt-4o">gpt-4o</option>
  <option value="gpt-4o-mini" selected="true">gpt-4o-mini</option>
  <option value="gpt-3.5-turbo">gpt-3.5-turbo</option>
  <option value="o3-mini">o3-mini (tier 3+ required)</option>
  <option value="o1-mini">o1-mini</option>
  <option value="o1">o1</option>
  <option value="o1-preview">o1-preview</option>
  <option value="gpt-4-32k-0314">gpt-4-32k-0314</option>
  <option value="text-embedding-3-large">text-embedding-3-large</option>
  <option value="text-embedding-3-small">text-embedding-3-small</option>
  <option value="text-embedding-ada-002">text-embedding-ada-002</option>
  <option value="deepseek-chat">deepseek-chat (deepseek-v3)</option>
  <option value="deepseek-reasoner">deepseek-reasoner (deepseek-r1)</option>
  </select>

You can also change target endpoint routings - to azureopenai or openai - at the header of public/browser-page.js (public/browser-console.js for the console client).

By default, requests to language model endpoints are handled by the regular OpenAI. You can reconfigure specific models to be handled by Azure OpenAI and/or the regular OpenAI.

Uncomment corresponding lines and make sure that Azure OpenAI deployments exist as mentioned above.

const targetEndpoints = {
  //"4o": "azureopenai",
  //"o1": "openai",
  //"o1-mini": "azureopenai",
  //"o3-mini": "azureopenai",
  //"embedding": "azureopenai",
  //"gpt-4o-audio-preview": "azureopenai",
  "deepseek": "deepseek",
  "default": "openai",
};

By default, text models use the classic API endpoint /v1/chat/completions You can configure specific models to use the newer /v1/responses. Add them into the array of modelsThatSupportResponses

const modelsThatSupportResponses = [ // As of March 22, 2025
  "computer-use-preview", // computer-use-preview only supported responses APi; it did not support chat API
  "gpt-4.5-preview",      // supports both, responses and chat API
  "gpt-4o",               // supports both, responses and chat API
  //"gpt-4o-mini",        // supports both, responses and chat API
  //"gpt-4-32k-0314",     // supports both, responses and chat API
  //"gpt-3.5-turbo",      // supports both, responses and chat API
  //"o1",                 // o1 was working much slower on responses API
  "o1-pro",               // o1-pro only supported responses APi; it did not support chat API
  //"o3-mini",            // o3-mini was working much slower; o1-mini and o1-preview did not support responses API
];

User interface with streaming output, which consumes data from the server

Open http://localhost:3000 in your browser.

Alternatively, you can open public/index.html from the local folder and click the Send button.

Browser console client with streaming output, which consumes data from the server

Execute public/browser-console.js in the browser's console available by pressing F12 in Chrome.

How to test keyless authentication for Azure OpenAI on this express server

To test keyless (Entra ID) authentication locally, follow these steps:

Install Azure Cli (az) on your machine: https://learn.microsoft.com/en-us/cli/azure/install-azure-cli
Open your OpenAI resource in the Azure portal, navigate to Access Control (IAM) and add the role Cognitive Services OpenAI User to your Entra ID account.
Open your .env file. Comment out or remove the line AZURE_OPENAI_API_KEY=...
Login to your Azure Portal using the following command. Select your subscription for the Azure AI resource if required:
- az login --scope https://cognitiveservices.azure.com/.default
Run the server using the following command: node server-entraid.js. Keyless authentication should start to work.

Useful links

https://learn.microsoft.com/en-us/azure/ai-services/openai/chatgpt-quickstart?tabs=command-line%2Cjavascript-key%2Ctypescript-keyless%2Cpython-new&pivots=programming-language-javascript

https://techcommunity.microsoft.com/blog/azuredevcommunityblog/using-keyless-authentication-with-azure-openai/4111521

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
docs/images		docs/images
openai		openai
public		public
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.full-scale-image		Dockerfile.full-scale-image
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
server.js		server.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Updates and bug fixes:

Getting started

Minimal ChatGPT server powered by express.js with streaming outputs

User interface with streaming output, which consumes data from the server

Browser console client with streaming output, which consumes data from the server

How to test keyless authentication for Azure OpenAI on this express server

Useful links

About

Releases

Packages

Contributors 2

Languages

Paul-Borisov/minimal-azure-openai-express-html-with-streaming

Folders and files

Latest commit

History

Repository files navigation

Overview

Updates and bug fixes:

Getting started

Minimal ChatGPT server powered by express.js with streaming outputs

User interface with streaming output, which consumes data from the server

Browser console client with streaming output, which consumes data from the server

How to test keyless authentication for Azure OpenAI on this express server

Useful links

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages