[DEVX-828] Added image summarization in multimodal pipeline #31

mansi-k · 2024-11-19T15:51:06Z

https://clarifai.atlassian.net/browse/DEVX-828

abhash-ai · 2024-11-20T14:17:52Z

clarifai_datautils/multimodal/pipeline/summarizer.py

+    resp = self.model.predict(img_inputs)
+
+    new_elements = []
+    for i, element in enumerate(resp.outputs):
+      summary = ""
+      if image_elements[i].text:


are we sure that the index of image in image_elements is same as in resp.outputs?

abhash-ai · 2024-11-20T14:20:15Z

clarifai_datautils/multimodal/pipeline/loaders.py

    image_data = meta.pop('image_base64', None)
+    id = meta.get('input_id', None)


QQ: why are we adding this new field ID and will it be used?

Yes, this is to identify the corresponding summary that was generated for the images in the PDF!

sanjaychelliah

Great addition! Left some comments!

clarifai_datautils/multimodal/pipeline/summarizer.py

sanjaychelliah · 2024-11-20T14:18:37Z

clarifai_datautils/multimodal/pipeline/summarizer.py

+    new_elements = []
+    for i, element in enumerate(resp.outputs):
+      summary = ""
+      if image_elements[i].text:


I believe image elements will not have text, so why this check here?

I observed that some image elements had text too... it can be seen in the output of 9th cell in this notebook

clarifai_datautils/multimodal/pipeline/summarizer.py

sanjaychelliah · 2024-11-20T15:17:56Z

clarifai_datautils/multimodal/pipeline/summarizer.py

+  """ Summarizes image elements. """
+
+  def __init__(self,
+               pat,


PAT can be optional, if not passed, the Clarifai SDK itself will check in env and return a error if is not set in env.

sanjaychelliah

👍

tested summarizer elements

eb8d6e9

mansi-k requested a review from sanjaychelliah November 19, 2024 15:51

mansi-k added 5 commits November 20, 2024 17:26

added input id for images

2979c62

added input id for images

af46344

added input id for images

d52f5a6

test case

fe0d75e

test case

f7dd88a

abhash-ai reviewed Nov 20, 2024

View reviewed changes

sanjaychelliah reviewed Nov 20, 2024

View reviewed changes

clarifai_datautils/multimodal/pipeline/summarizer.py Show resolved Hide resolved

sanjaychelliah requested a review from mogith-pn November 20, 2024 14:25

mansi-k added 3 commits November 20, 2024 20:06

test case

09c977c

addressed comments

c0cbfb4

addressed comments

9203227

sanjaychelliah reviewed Nov 20, 2024

View reviewed changes

addressed comments

38ccc91

mansi-k requested review from abhash-ai and sanjaychelliah November 20, 2024 16:03

addressed comments

e3a9f13

sanjaychelliah approved these changes Nov 21, 2024

View reviewed changes

abhash-ai approved these changes Nov 22, 2024

View reviewed changes

mansi-k merged commit 17a6230 into main Nov 22, 2024
8 checks passed

sanjaychelliah mentioned this pull request Dec 18, 2024

VERSION 0.0.6 #33

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DEVX-828] Added image summarization in multimodal pipeline #31

[DEVX-828] Added image summarization in multimodal pipeline #31

Uh oh!

mansi-k commented Nov 19, 2024

Uh oh!

abhash-ai Nov 20, 2024

Uh oh!

abhash-ai Nov 20, 2024

Uh oh!

sanjaychelliah Nov 20, 2024

Uh oh!

sanjaychelliah left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sanjaychelliah Nov 20, 2024 •

edited

Loading

Uh oh!

mansi-k Nov 20, 2024

Uh oh!

Uh oh!

Uh oh!

sanjaychelliah Nov 20, 2024

Uh oh!

sanjaychelliah left a comment

Uh oh!

Uh oh!

Uh oh!

		image_data = meta.pop('image_base64', None)
		id = meta.get('input_id', None)

[DEVX-828] Added image summarization in multimodal pipeline #31

[DEVX-828] Added image summarization in multimodal pipeline #31

Uh oh!

Conversation

mansi-k commented Nov 19, 2024

Uh oh!

abhash-ai Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

abhash-ai Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

sanjaychelliah Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

sanjaychelliah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sanjaychelliah Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mansi-k Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sanjaychelliah Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

sanjaychelliah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sanjaychelliah Nov 20, 2024 •

edited

Loading