-
Couldn't load subscription status.
- Fork 0
Open
Description
Description of the new feature
The parser should be able to detect and handle the different versions of the same PDF document to generate high-quality metadata for your Retrieval Augmented Generation (RAG) system. By extracting structured data from unstructured documents, we can filter results more effectively and drastically improve retrieval accuracy.
Proposed technical implementation details
refer this video (link). This example is based on LangExtract with Gemini and Ollma models if required. Refer to this link and image for more details.

Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request