Add support for image upload to multimodal models #59

andrewnguonly · 2024-02-03T04:56:08Z

Summary

This PR resolves #27.

llava and bakllava are multimodal models available through Ollama. Images that are present on the current tab will be downloaded and bound to the model.

Implementation

Move getHtmlContent() to a separate scripts/content.ts file.
Update getHtmlContent() to retrieve src URLs from <img> elements inside the returned elements from the selectors and selectorsAll queries.
Update background script to download images and bind the base64 encoded image data to the model.

andrewnguonly added 7 commits February 2, 2024 14:42

Move getHtmlContent() to separate content script file.

8c1f5b9

Download images and bind image data to model.

ca99b4e

Update README. Move screenshots to screenshots directory.

aa789fb

Add error handling for fetch request.

353625c

Optimally download images based on the prompt.

f2c99af

Add manual override for image prompt classification.

f40d28c

Update README, comments, and docstrings.

67df582

andrewnguonly merged commit df9336c into 1.0.4 Feb 3, 2024

andrewnguonly deleted the multi-modal branch February 3, 2024 20:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for image upload to multimodal models #59

Add support for image upload to multimodal models #59

andrewnguonly commented Feb 3, 2024

Add support for image upload to multimodal models #59

Add support for image upload to multimodal models #59

Conversation

andrewnguonly commented Feb 3, 2024

Summary

Implementation