Expose GTO's model version in FastAPI's `interface.json` #665

aguschin · 2023-04-27T02:40:48Z

Follow-up for #664. Sometimes it's desired to store predictions along with the specific model version that returned those predictions. There are at least two ways to support that in MLEM:

Return model version in prediction - then what's returned will be a json like {"prediction": [0.4, 0.6], "version": 0.1.3}. I've seen some generic ML frameworks doing this IIRC.
Return it in interface.json - we already have MLEM version there, so adding model version looks logical

Regarding how we get this info into the service. Again, there are two approaches:

Add it at mlem.api.save
Allow to specify it when building server

First seems more reasonable to me. Since this will require some under-the-hood integration with GTO, I'd do this after #664 - which have the same decision to make.

fyi @omesser

The text was updated successfully, but these errors were encountered:

aguschin · 2023-05-25T13:06:41Z

For the record, I'm going to take 2nd option (Return version in interface.json).

And it looks like I need iterative/gto#335 completed first to support this in MLEM.

aguschin · 2023-05-30T09:31:29Z

UPD: getting the version at mlem.api.save doesn't work since at this moment the commit doesn't exist, not mentioning the right GTO git tag. So it should be read once you build a server.

Bare minimum needed for iterative/mlem#665 ```[tasklist] - [ ] Allow passing `refs`, getting right version - [ ] Add tests ``` Overall, feels like GTO codebase needs a refactoring... While working on this I started with #361, and it's tough to add anything like that without rewriting big chunks of GTO now. There are many things each stepping on other's toes... Plus many implementation decisions were made taking annotations into account, which is no longer a thing in GTO, except for some API for Studio to enable backward compatibility while making updates easier. They should be thrown away to make it easier to contribute new features 🪣

aguschin added serialization Dumping and loading Python objects serve Serving models customer Request from customer labels Apr 27, 2023

aguschin self-assigned this May 24, 2023

aguschin mentioned this issue May 25, 2023

Allow passing ref to api._show_versions iterative/gto#362

Merged

aguschin linked a pull request May 30, 2023 that will close this issue

Expose GTO version in FastAPI's interface #681

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose GTO's model version in FastAPI's `interface.json` #665

Expose GTO's model version in FastAPI's `interface.json` #665

aguschin commented Apr 27, 2023

aguschin commented May 25, 2023 •

edited

Loading

aguschin commented May 30, 2023

Expose GTO's model version in FastAPI's interface.json #665

Expose GTO's model version in FastAPI's interface.json #665

Comments

aguschin commented Apr 27, 2023

aguschin commented May 25, 2023 • edited Loading

aguschin commented May 30, 2023

Expose GTO's model version in FastAPI's `interface.json` #665

Expose GTO's model version in FastAPI's `interface.json` #665

aguschin commented May 25, 2023 •

edited

Loading