Support GenerativeModel.count_tokens[_async] for fully-featured generation requests #3631

adubovik · 2024-04-17T12:55:08Z

It should be possible to count number of prompt tokens via GenerativeModel.count_tokens[_async] method before running an actual generation request.

However, the method takes into account only the contents component of a request.

Other components of a request (such as tools, tool_config, and system_instruction) presumably affect prompt token count, so it would be great to support them in the count-token methods too.

The text was updated successfully, but these errors were encountered:

Ark-kun · 2024-05-08T08:33:20Z

Thank you for the feedback. This makes sense.
However this is currently not supported by the service API:

python-aiplatform/google/cloud/aiplatform_v1beta1/types/prediction_service.py

Line 702 in cd85d8f

class CountTokensRequest(proto.Message):

product-auto-label bot added the api: vertex-ai Issues related to the googleapis/python-aiplatform API. label Apr 17, 2024

Ark-kun self-assigned this May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support GenerativeModel.count_tokens[_async] for fully-featured generation requests #3631

Support GenerativeModel.count_tokens[_async] for fully-featured generation requests #3631

adubovik commented Apr 17, 2024

Ark-kun commented May 8, 2024 •

edited

Support GenerativeModel.count_tokens[_async] for fully-featured generation requests #3631

Support GenerativeModel.count_tokens[_async] for fully-featured generation requests #3631

Comments

adubovik commented Apr 17, 2024

Ark-kun commented May 8, 2024 • edited

Ark-kun commented May 8, 2024 •

edited