Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support multimodal use cases #6674

Closed
2 tasks
julian-risch opened this issue Jan 2, 2024 · 1 comment
Closed
2 tasks

Support multimodal use cases #6674

julian-risch opened this issue Jan 2, 2024 · 1 comment
Labels
2.x Related to Haystack v2.0 epic:abandoned Epic was abandoned and not finished epic

Comments

@julian-risch
Copy link
Member

julian-risch commented Jan 2, 2024

Haystack 2.0 documents support more than just text data but multimodal uses cases require tailored components. This epic issue is about enabling one of those use cases and getting an understanding of what blocks other use cases. Focus should be on preprocessors, retrievers and readers/generators for Table QA.

Table QA
Includes better preprocessing for example for tables split across multiple PDF pages, retrievers, and readers/generators

Visual QA
Answering questions about the content of images in general using both visual and textual information

Image Captioning
Indexing images and creating captions for better retrieval

Speech-to-Text and Text-to-Speech
(in addition to our WhisperTranscriber components)

Tasks

  1. feature request integration:google-ai
  2. 2.x P3
@julian-risch julian-risch added 2.x Related to Haystack v2.0 epic labels Jan 2, 2024
@julian-risch
Copy link
Member Author

Related issue about multimodal embedding #5943

@masci masci added the epic:abandoned Epic was abandoned and not finished label May 25, 2024
@masci masci closed this as completed May 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.x Related to Haystack v2.0 epic:abandoned Epic was abandoned and not finished epic
Projects
Development

No branches or pull requests

2 participants