Custom Models

Support for external models is in beta. Please try it out and let me know about any bugs/issues.

If you open your user settings and click the button to show advanced options, you'll see an input text box that allows you to declare custom models (one per line). You can put any OpenAI API-compatible model there.

Basaran is a good option for running any Hugging Face model in a way that's compatible with the OpenAI-style API. Here are the steps:

Step 1:

Install Docker

Step 2:

Run this command in your terminal:

docker run --rm -p 80:80 -e MODEL=databricks/dolly-v2-3b hyperonym/basaran:0.19.0

You can change databricks/dolly-v2-3b to any* Hugging Face language model (see this list), and you can change 0.19.0 to the latest version from here (note: 0.21.1 has a bug).

*Currently, the model must use the newer "fast tokenizer" format - it should have a tokenizer.json file in its Hugging Face repo.

Step 3:

Add this line to that text box in your user settings:

{name:"databricks/dolly-v2-3b", endpointUrl:"http://127.0.0.1/v1/completions"}

That's it

The model selector at the top of the thread should show your new model.

Full parameter list

name - passed to the API as the model parameter
endpointUrl - the URL of the inference server's completion endpoint
modelUrl (optional) - if omitted, the URL is assumed to be https://huggingface.co/${name} - this is used for the tokenizer & config
tokenPricing (optional) - an object like {prompt:0.002, completion:0.002} with prices per 1k tokens
maxSequenceLength (optional) - this will be fetched from Hugging Face automatically, but you should include it if you're using a non-Hugging Face model or if the value can't be found automatically
apiKey (optional) - just in case you're using a service that requires an API key
type (optional) - if not provided, this will be automatically inferred from the URL (v1/completions vs v1/chat/completions)
- use chat-completion if the API accepts messages as an input
- use completion if the APIs accepts prompt as an input

Alternatives to Basaran

Some projects to keep an eye on:

https://github.com/go-skynet/LocalAI
https://github.com/closedai-project/closedai
https://github.com/r2d4/openlm

Also, if you know of any API wrappers for Anthropic, Cohere, etc. models that make them OpenAI API-compatible, then please let me know here on Github or on the Discord and I can make sure the custom model system works with them. It should work with any API endpoint that's compatible with OpenAI's completion APIs.

Note that, if you do end up finding an API wrapper, you'll still want to use the correct Hugging Face repo name to load the tokenizer - e.g. Cohere/Command-nightly contains the tokenizer.json needed for the Cohere APIs.

Swapping out OpenAI API for an exact duplicate/proxy

There are some services which duplicate OpenAI's APIs exactly (including the actual model names) because they're actually just proxying it or have licensed access to OpenAI's models (e.g. Azure). The custom model config doesn't fully support this use case yet, so if you run into any troubles with the custom model configs, you can instead run the following code after page load using the "after page load" custom code box in your user settings:

let proxyHandler = {
  apply: async function (target, thisArg, argumentsList) {
    let url = argumentsList[0];
    let opts = argumentsList[1];
    if(url.startsWith("https://api.openai.com")) {
      url = url.replace("https://api.openai.com", "https://api.example.com");
      opts.headers.authorization = `Bearer YOUR_API_KEY_HERE`;
    }
    return target.call(thisArg, url, opts, ...argumentsList.slice(2));
  },
};
let originalFetch = window.fetch;
window.fetch = new Proxy(fetch, proxyHandler);

You just need to replace api.example.com with whatever the URL of your proxy is.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

custom-models.md

custom-models.md

Custom Models

Step 1:

Step 2:

Step 3:

That's it

Full parameter list

Alternatives to Basaran

Swapping out OpenAI API for an exact duplicate/proxy

Files

custom-models.md

Latest commit

History

custom-models.md

File metadata and controls

Custom Models

Step 1:

Step 2:

Step 3:

That's it

Full parameter list

Alternatives to Basaran

Swapping out OpenAI API for an exact duplicate/proxy