Fine-tuning-using-Hugging-face-transformers

The abstract from the paper is the following:

Self-supervised pre-training techniques have achieved remarkable progress in Document AI. Most multimodal pre-trained models use a masked language modeling objective to learn bidirectional representations on the text modality, but they differ in pre-training objectives for the image modality. This discrepancy adds difficulty to multimodal representation learning. In this paper, we propose LayoutLMv3 to pre-train multimodal Transformers for Document AI with unified text and image masking. Additionally, LayoutLMv3 is pre-trained with a word-patch alignment objective to learn cross-modal alignment by predicting whether the corresponding image patch of a text word is masked. The simple unified architecture and training objectives make LayoutLMv3 a general-purpose pre-trained model for both text-centric and image-centric Document AI tasks. Experimental results show that LayoutLMv3 achieves state-of-the-art performance not only in text-centric tasks, including form understanding, receipt understanding, and document visual question answering, but also in image-centric tasks such as document image classification and document layout analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
fine-tune-layoutlmv3-on-funsd.ipynb		fine-tune-layoutlmv3-on-funsd.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

fine-tune-layoutlmv3-on-funsd.ipynb

fine-tune-layoutlmv3-on-funsd.ipynb

Repository files navigation

Fine-tuning-using-Hugging-face-transformers

About

Releases

Packages

Languages

soma2000-lang/Fine-tuning-using-Hugging-face-transformers

Folders and files

Latest commit

History

README.md

README.md

fine-tune-layoutlmv3-on-funsd.ipynb

fine-tune-layoutlmv3-on-funsd.ipynb

Repository files navigation

Fine-tuning-using-Hugging-face-transformers

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages