Demo notebook for LayoutLMForSequenceClassification #287

NielsRogge · 2021-01-13T09:51:58Z

Hey there,

I've recently improved LayoutLM in the HuggingFace Transformers library by adding some more documentation + code examples, a demo notebook that illustrates how to fine-tune LayoutLMForTokenClassification on the FUNSD dataset, some integration tests that verify whether the implementation in HuggingFace Transformers gives the same output tensors on the same input data as the original implementation, and finally LayoutLMForSequenceClassification. My PR was merged yesterday :)

However, now I'm also preparing a notebook that illustrates how to fine-tune LayoutLMForSequenceClassification on (a small subset of) the RVL-CDIP dataset. However, it doesn't seem to be able to overfit the tiny subset (I have 16 images per class, so as there are 16 labels I have 256 training examples). You can run it here: https://colab.research.google.com/drive/1DUpTi2aL64AuIJ_9g6dGgKfltEEFqQbt?usp=sharing

Any feedback is greatly appreciated!

The text was updated successfully, but these errors were encountered:

NielsRogge · 2021-01-14T15:40:36Z

Btw, the demo notebook for fine-tuning LayoutLMForTokenClassification on the FUNSD dataset can be found here.

aritzLizoain · 2021-02-25T15:41:59Z

Hi @NielsRogge, thanks for providing the notebooks!

I am working with your demo notebook for fine-tuning LayoutLMForTokenClassification. How can we save the fine-tuned model in order to use it in for inference in the future? I don't see any output file after fine-tuning.

Thank you in advance!

NielsRogge · 2021-02-25T15:49:10Z

Hi! In HuggingFace, a model can be saved using model.save_pretrained("name-of-your-directory"). This will save both the weights (pytorch_model.bin file), as well as the configuration (config.json) to the directory.

aritzLizoain · 2021-02-25T16:11:08Z

Thank you for your prompt reply!

monuminu · 2021-04-17T12:29:30Z

@NielsRogge Cant thank you enough . It really helped . I took your code and implemented without installing unlim basically pure transformers . Would love to add to your repo my notebook .

VishnuGopireddy · 2021-04-17T12:45:50Z

@NielsRogge I am getting "PicklingError: Can't pickle <class 'layoutlm.data.funsd.InputFeatures'>: import of module 'layoutlm.data.funsd' failed" while preparing a dataloader for FUNSD dataset. Can you please help?

NielsRogge · 2021-04-17T12:48:45Z

Hi @monuminu @VishnuGopireddy I have a new notebook that adds visual features from a Resnet-101 backbone in addition to the text + layout features. You can find it here: https://github.com/NielsRogge/Transformers-Tutorials/blob/master/LayoutLM/Add_image_embeddings_to_LayoutLM.ipynb

It relies entirely on HuggingFace Transformers, no need for the unilm repo anymore :)

VishnuGopireddy · 2021-04-17T13:05:26Z

@NielsRogge Awesome!!! Woking fine. Big thanks.

monuminu · 2021-04-18T13:13:19Z

@NielsRogge amazing work !

VishnuGopireddy · 2021-04-20T02:53:58Z

Hi @NielsRogge Nice work, I am able to get all the tags for each word. Is there any way/ approach to get correspondence between tags? I mean mapping question to the answer. Thanks...

nkrot · 2021-06-11T09:05:46Z

Hi @NielsRogge,
Thanks for the notebook, it is very instructive!
And it would be even more useful if it contained an example showing how to use the model fine-tuned.

vinayakk1094 · 2021-09-28T08:22:24Z

The link seems to be broken - 'Sorry, the file you have requested does not exist.'

NielsRogge · 2021-09-28T08:23:42Z

@vinayakk1094 hi, all tutorials can be found here (both for LayoutLM and LayoutLMv2): https://github.com/NielsRogge/Transformers-Tutorials

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo notebook for LayoutLMForSequenceClassification #287

Demo notebook for LayoutLMForSequenceClassification #287

NielsRogge commented Jan 13, 2021 •

edited

NielsRogge commented Jan 14, 2021 •

edited

aritzLizoain commented Feb 25, 2021

NielsRogge commented Feb 25, 2021

aritzLizoain commented Feb 25, 2021

monuminu commented Apr 17, 2021

VishnuGopireddy commented Apr 17, 2021

NielsRogge commented Apr 17, 2021

VishnuGopireddy commented Apr 17, 2021

monuminu commented Apr 18, 2021

VishnuGopireddy commented Apr 20, 2021

nkrot commented Jun 11, 2021

vinayakk1094 commented Sep 28, 2021

NielsRogge commented Sep 28, 2021

Demo notebook for LayoutLMForSequenceClassification #287

Demo notebook for LayoutLMForSequenceClassification #287

Comments

NielsRogge commented Jan 13, 2021 • edited

NielsRogge commented Jan 14, 2021 • edited

aritzLizoain commented Feb 25, 2021

NielsRogge commented Feb 25, 2021

aritzLizoain commented Feb 25, 2021

monuminu commented Apr 17, 2021

VishnuGopireddy commented Apr 17, 2021

NielsRogge commented Apr 17, 2021

VishnuGopireddy commented Apr 17, 2021

monuminu commented Apr 18, 2021

VishnuGopireddy commented Apr 20, 2021

nkrot commented Jun 11, 2021

vinayakk1094 commented Sep 28, 2021

NielsRogge commented Sep 28, 2021

NielsRogge commented Jan 13, 2021 •

edited

NielsRogge commented Jan 14, 2021 •

edited