OpenAI: /v1/models/{model} compatibility #5028

royjhan · 2024-06-13T18:29:28Z

Adds compatibility for /v1/models/{model}

E.g
curl http://localhost:11434/v1/models/llama3

{
    "id": "llama3",
    "object": "model",
    "created": 1718141294,
    "owned_by": "library"
}

JerrettDavis

Looks good! Way better approach than what I was attempting in my branch.

jmorganca · 2024-06-13T20:07:38Z

openai/openai.go

+func (w *DeleteWriter) writeResponse(data []byte) (int, error) {
+	// delete completion
+	w.ResponseWriter.Header().Set("Content-Type", "application/json")
+	err := json.NewEncoder(w.ResponseWriter).Encode(toDeleteCompletion(w.model))


We most likely don't need toDeleteCompletion and can just provide the struct inline here

openai/openai.go

server/routes.go

jmorganca · 2024-06-15T06:32:12Z

server/routes_test.go

@@ -233,6 +233,24 @@ func Test_Routes(t *testing.T) {
 				assert.Equal(t, expectedParams, params)
 			},
 		},
+		{
+			Name:   "Retrieve Model Handler OpenAI",


This works, however what would be even better is some tests in openai.go that test the middleware in isolation. Here's an example I had an LLM generate me for a middleware that changes formats (not the same although I think you get the idea)

package main import ( "net/http" "net/http/httptest" "testing" "github.com/gin-gonic/gin" ) func TestFormatMiddleware(t *testing.T) { // Create a test router r := gin.New() r.Use(FormatMiddleware()) r.GET("/test", func(c *gin.Context) { format := c.GetString("format") if format == "json" { c.JSON(http.StatusOK, gin.H{"message": "Success"}) } else if format == "xml" { c.XML(http.StatusOK, gin.H{"message": "Success"}) } else { c.String(http.StatusOK, "Success") } }) // Define test cases tests := []struct { name string queryParam string expectedStatus int expectedBody string }{ {"JSON Format", "json", http.StatusOK, `{"message":"Success"}`}, {"XML Format", "xml", http.StatusOK, `<map><message>Success</message></map>`}, {"Default Format", "", http.StatusOK, "Success"}, } for _, tt := range tests { t.Run(tt.name, func(t *testing.T) { req, _ := http.NewRequest("GET", "/test?format="+tt.queryParam, nil) w := httptest.NewRecorder() r.ServeHTTP(w, req) if w.Code != tt.expectedStatus { t.Errorf("Expected status %d, got %d", tt.expectedStatus, w.Code) } if w.Body.String() != tt.expectedBody { t.Errorf("Expected body %s, got %s", tt.expectedBody, w.Body.String()) } }) } }

jmorganca · 2024-06-20T00:11:13Z

openai/openai.go

@@ -175,7 +175,16 @@ func toListCompletion(r api.ListResponse) ListCompletion {
 	}
 }

-func fromRequest(r ChatCompletionRequest) api.ChatRequest {
+func toRetrieveCompletion(r api.ShowResponse, model string) Model {


should we change this to toModel?

Suggested change

func toRetrieveCompletion(r api.ShowResponse, model string) Model {

func toModel(r api.ShowResponse, model string) Model {

jmorganca · 2024-06-23T02:22:12Z

openai/openai.go

+		Id:      model,
+		Object:  "model",
+		Created: r.ModifiedAt.Unix(),
+		OwnedBy: "ollama",


This should be the Ollama username similar to /v1/models

jmorganca · 2024-06-23T02:25:36Z

openai/openai_test.go

@@ -7,6 +7,7 @@ import (
 	"net/http"
 	"net/http/httptest"
 	"testing"
+	"time"

 	"github.com/gin-gonic/gin"


Can we remove Gin from this test file (in the PR this branches off of)

Gin is needed to test the middleware, do you mind explaining what you mean?

jmorganca · 2024-06-23T02:26:19Z

openai/openai_test.go

+					t.Fatal(err)
+				}
+
+				assert.Equal(t, "model", retrieveResp.Object)


Let's not use assert either – just t.Fatal t.Fatalf etc in the standard testing library is best

jmorganca · 2024-06-23T02:26:28Z

openai/openai_test.go

@@ -87,6 +88,27 @@ func TestMiddleware(t *testing.T) {
 				assert.Equal(t, "Test Model", listResp.Data[0].Id)
 			},
 		},
+		{
+			Name:       "OpenAI Retrieve Handler",


retrieve model

jmorganca · 2024-06-23T02:27:28Z

server/routes.go

@@ -660,6 +660,28 @@ func (s *Server) ShowModelHandler(c *gin.Context) {
 	c.JSON(http.StatusOK, resp)
 }

+func (s *Server) RetrieveModelHandler(c *gin.Context) {


Is this used? I believe it's implemented in middleware now?

* OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By: Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by: Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

Retrieve Model

eccd191

JerrettDavis approved these changes Jun 13, 2024

View reviewed changes

royjhan changed the title ~~OpenAI: /v1/models/{model} compatability~~ OpenAI: /v1/models/{model} compatibility Jun 13, 2024

OpenAI Delete Model

c7ec861

royjhan changed the title ~~OpenAI: /v1/models/{model} compatibility~~ OpenAI: /v1/models/{model} Retrieve and Delete compatibility Jun 13, 2024

jmorganca reviewed Jun 13, 2024

View reviewed changes

openai/openai.go Outdated Show resolved Hide resolved

jmorganca reviewed Jun 13, 2024

View reviewed changes

server/routes.go Outdated Show resolved Hide resolved

royjhan added 2 commits June 13, 2024 15:35

Retrieve Middleware

81791a0

Remove Delete from Branch

8fd3e47

royjhan changed the title ~~OpenAI: /v1/models/{model} Retrieve and Delete compatibility~~ OpenAI: /v1/models/{model} Retrieve compatibility Jun 14, 2024

jmorganca reviewed Jun 15, 2024

View reviewed changes

royjhan added 3 commits June 15, 2024 22:02

Merge branch 'royh-openai' into royh-retrieve

7ccbfa4

Update Test

48135e7

Middleware Test File

ab8db6e

jmorganca reviewed Jun 20, 2024

View reviewed changes

Function name

1c6813d

royjhan changed the title ~~OpenAI: /v1/models/{model} Retrieve compatibility~~ OpenAI: /v1/models/{model} compatibility and v1/completions compatibility Jun 21, 2024

royjhan changed the title ~~OpenAI: /v1/models/{model} compatibility and v1/completions compatibility~~ OpenAI: /v1/models/{model} compatibility Jun 22, 2024

royjhan force-pushed the royh-retrieve branch from c71394c to 1c6813d Compare June 22, 2024 00:48

Merge branch 'royh-openai' into royh-retrieve

1ef5455

jmorganca reviewed Jun 23, 2024

View reviewed changes

royjhan and others added 4 commits June 24, 2024 09:37

Merge branch 'royh-openai' into royh-retrieve

9f28756

Cleanup

f3f842a

Test Update

1865360

Test Update

c716e55

jmorganca approved these changes Jun 28, 2024

View reviewed changes

royjhan merged commit 9bd9d39 into royh-openai Jul 2, 2024
12 checks passed

royjhan deleted the royh-retrieve branch July 2, 2024 18:40

royjhan mentioned this pull request Jul 2, 2024

OpenAI: /v1/models and /v1/models/{model} compatibility #5007

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI: /v1/models/{model} compatibility #5028

OpenAI: /v1/models/{model} compatibility #5028

royjhan commented Jun 13, 2024 •

edited

Loading

JerrettDavis left a comment

jmorganca Jun 13, 2024

jmorganca Jun 15, 2024 •

edited

Loading

jmorganca Jun 20, 2024

jmorganca Jun 23, 2024 •

edited

Loading

jmorganca Jun 23, 2024

royjhan Jun 24, 2024

jmorganca Jun 23, 2024

jmorganca Jun 23, 2024

jmorganca Jun 23, 2024

	func toRetrieveCompletion(r api.ShowResponse, model string) Model {
	func toModel(r api.ShowResponse, model string) Model {

OpenAI: /v1/models/{model} compatibility #5028

OpenAI: /v1/models/{model} compatibility #5028

Conversation

royjhan commented Jun 13, 2024 • edited Loading

JerrettDavis left a comment

Choose a reason for hiding this comment

jmorganca Jun 13, 2024

Choose a reason for hiding this comment

jmorganca Jun 15, 2024 • edited Loading

Choose a reason for hiding this comment

jmorganca Jun 20, 2024

Choose a reason for hiding this comment

jmorganca Jun 23, 2024 • edited Loading

Choose a reason for hiding this comment

jmorganca Jun 23, 2024

Choose a reason for hiding this comment

royjhan Jun 24, 2024

Choose a reason for hiding this comment

jmorganca Jun 23, 2024

Choose a reason for hiding this comment

jmorganca Jun 23, 2024

Choose a reason for hiding this comment

jmorganca Jun 23, 2024

Choose a reason for hiding this comment

royjhan commented Jun 13, 2024 •

edited

Loading

jmorganca Jun 15, 2024 •

edited

Loading

jmorganca Jun 23, 2024 •

edited

Loading