Expose new `pc.inference.list_models()` and `pc.inference.get_model()` #488

jhamon · 2025-05-15T16:37:21Z

Problem

We need to expose a new endpoint for discovering available inference models

Solution

Regenerate code off the latest spec
Wire the new method up in the sync and async implementations of Inference
- pc.inference.get_model
- pc.inference.list_models
Make some adjustments in model_utils to be less fragile if unexpected values appear in enum fields
Implement new tests for these list_models endpoints.

Usage

from pinecone import Pinecone

pc = Pinecone()

models = pc.inference.list_models()
models[0]
# {
#     "model": "llama-text-embed-v2",
#     "short_description": "A high performance dense embedding model optimized for multilingual and cross-lingual text question-answering retrieval with support for long documents (up to 2048 tokens) and dynamic embedding size (Matryoshka Embeddings).",
#     "type": "embed",
#     "supported_parameters": [
#         {
#             "parameter": "input_type",
#             "type": "one_of",
#             "value_type": "string",
#             "required": true,
#             "allowed_values": [
#                 "query",
#                 "passage"
#             ]
#         },
#         {
#             "parameter": "truncate",
#             "type": "one_of",
#             "value_type": "string",
#             "required": false,
#             "default": "END",
#             "allowed_values": [
#                 "END",
#                 "NONE",
#                 "START"
#             ]
#         },
#         {
#             "parameter": "dimension",
#             "type": "one_of",
#             "value_type": "integer",
#             "required": false,
#             "default": 1024,
#             "allowed_values": [
#                 384,
#                 512,
#                 768,
#                 1024,
#                 2048
#             ]
#         }
#     ],
#     "vector_type": "dense",
#     "default_dimension": 1024,
#     "modality": "text",
#     "max_sequence_length": 2048,
#     "max_batch_size": 96,
#     "provider_name": "NVIDIA",
#     "supported_metrics": [
#         "Cosine",
#         "DotProduct"
#     ],
#     "supported_dimensions": [
#         384,
#         512,
#         768,
#         1024,
#         2048
#     ]
# }

And async

import asyncio
from pinecone import PineconeAsyncio

async def main():
  with PineconeAsyncio() as pc:
    await pc.inference.list_models()

asyncio.run(main())

Type of Change

New feature (non-breaking change which adds functionality)

jhamon changed the base branch from main to release-candidate/2025-04 May 15, 2025 16:37

jhamon force-pushed the jhamon/list-models branch from 7087d07 to 27c4f08 Compare May 15, 2025 18:15

jhamon changed the title ~~Expose new pc.inference.list_models()~~ Expose new pc.inference.list_models() and pc.inference.get_model() May 15, 2025

jhamon added 7 commits May 16, 2025 10:02

WIP

f5131ed

Implement pc.inference.list_models()

a393a26

Add more tests

8f081ba

Adjust enum value enforcement

a3ed1ad

Remove duplicate import

0fb441e

Add get_model

6b47729

Fix type issue

c4c78b2

jhamon force-pushed the jhamon/list-models branch from 86c5f66 to c4c78b2 Compare May 16, 2025 14:02

jhamon marked this pull request as ready for review May 16, 2025 15:24

jhamon merged commit c1688f6 into release-candidate/2025-04 May 16, 2025
71 of 72 checks passed

jhamon deleted the jhamon/list-models branch May 16, 2025 15:24

jhamon mentioned this pull request May 27, 2025

Fix missing readline error on Windows #503

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expose new `pc.inference.list_models()` and `pc.inference.get_model()` #488

Expose new `pc.inference.list_models()` and `pc.inference.get_model()` #488

Uh oh!

jhamon commented May 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Expose new pc.inference.list_models() and pc.inference.get_model() #488

Expose new pc.inference.list_models() and pc.inference.get_model() #488

Uh oh!

Conversation

jhamon commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Usage

Type of Change

Uh oh!

Uh oh!

Uh oh!

Expose new `pc.inference.list_models()` and `pc.inference.get_model()` #488

Expose new `pc.inference.list_models()` and `pc.inference.get_model()` #488

jhamon commented May 15, 2025 •

edited

Loading