fix t5 tokenizer and prompt token failures #1966

rohithkrn · 2024-05-24T00:54:14Z

In lmi-dist, t5 model impl is different from other models and preprocessor's tokenizer attribute contains tokenizer directly instead of TokenizerGroup
skip sending prompt token details for t5 as it does not populate those in request output. Current code errors out as request_output.prompt_token_ids is an integer of bos token and fails in the for loop enumeration.

sindhuvahinis · 2024-05-24T01:00:45Z

engines/python/setup/djl_python/rolling_batch/lmi_dist_rolling_batch.py

@@ -47,6 +47,8 @@ def __init__(self, model_id_or_path: str, properties: dict, **kwargs):
        :param properties (dict): other properties of the model, such as decoder strategy
        """
        self.lmi_dist_config = LmiDistRbProperties(**properties)
+        self.model_type = getattr(kwargs.get("model_config", None),
+                                  "model_type", None)


QQ: I see we are populating model_config, Where do we populate "model_type"

model_config here is hugging face config.json which has model_type populated.

Oh misread it, Model_type is read from model_config. Cool

engines/python/setup/djl_python/rolling_batch/lmi_dist_rolling_batch.py

sindhuvahinis · 2024-05-24T01:07:17Z

engines/python/setup/djl_python/rolling_batch/rolling_batch_vllm_utils.py

@@ -75,6 +75,9 @@ def update_request_cache_with_output(request_cache: OrderedDict,
    if "prompt_tokens_details" not in request_cache[
            request_id] and request_output.prompt_logprobs:


Curious, request_output should not have prompt_logprobs as well right?

from what I can read, request_output.prompt_logprobs is set to [float("nan")] and seems like it's passing that check.

fix t5 tokenizer and prompt token failures

8e4cc83

rohithkrn requested review from zachgk, frankfliu and a team as code owners May 24, 2024 00:54

sindhuvahinis reviewed May 24, 2024

View reviewed changes

engines/python/setup/djl_python/rolling_batch/lmi_dist_rolling_batch.py Show resolved Hide resolved

sindhuvahinis reviewed May 24, 2024

View reviewed changes

maaquib approved these changes May 28, 2024

View reviewed changes

sindhuvahinis approved these changes May 28, 2024

View reviewed changes

rohithkrn merged commit 714fabf into deepjavalibrary:master May 28, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix t5 tokenizer and prompt token failures #1966

fix t5 tokenizer and prompt token failures #1966

rohithkrn commented May 24, 2024

sindhuvahinis May 24, 2024

rohithkrn May 24, 2024

sindhuvahinis May 24, 2024

sindhuvahinis May 24, 2024

rohithkrn May 24, 2024

		@@ -75,6 +75,9 @@ def update_request_cache_with_output(request_cache: OrderedDict,
		if "prompt_tokens_details" not in request_cache[
		request_id] and request_output.prompt_logprobs:

fix t5 tokenizer and prompt token failures #1966

fix t5 tokenizer and prompt token failures #1966

Conversation

rohithkrn commented May 24, 2024

sindhuvahinis May 24, 2024

Choose a reason for hiding this comment

rohithkrn May 24, 2024

Choose a reason for hiding this comment

sindhuvahinis May 24, 2024

Choose a reason for hiding this comment

sindhuvahinis May 24, 2024

Choose a reason for hiding this comment

rohithkrn May 24, 2024

Choose a reason for hiding this comment