Skip to content

[Document Understanding] Can we support a new task on document understanding? #218

@jlia0

Description

@jlia0

Document Understanding

Some example models:

  1. DiT: https://huggingface.co/microsoft/dit-large
  2. LayoutLMv3: https://huggingface.co/microsoft/layoutlmv3-large
  3. Donut: https://huggingface.co/docs/transformers/model_doc/donut

Reason for request

Document understanding is a very popular task which I couldn't find any supports for the web environment.

Some tasks include:

  1. Key Information Extraction (KIE)
  2. Document Layout Analysis (DLA)
  3. Document Question Answering (DQA)
  4. Optical Character Recognition (OCR)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions