Skip to content

Document Reader Enhancements #296

@gotsysdba

Description

@gotsysdba

Checklist

  • I have searched the existing issues for similar feature requests.
  • I added a descriptive title and summary to this issue.

Summary

Broaden document processing and ingestion capabilities:

  • Add semantic chunking.
  • Complete web crawling feature (BeautifulSoup integration).
  • Explore directory/bucket “crawl” support.
  • Add Office docs and compressed file ingestion.
  • Consider Azure-based document services.
  • Integrate vector embedding with DB tables.

Why?

No response

How?

No response

Additional Context

No response

Sub-issues

Metadata

Metadata

Assignees

Labels

RoadmapUpcoming Roadmap Items

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions