Differences between original implementation and HuggingFace implementation #9228

osabnis · 2020-12-21T06:19:42Z

Environment info

transformers version: 4.0.0
Platform: Windows
Python version: 3.6.5
PyTorch version (GPU?): 1.6.0+cu101
Tensorflow version (GPU?): -
Using GPU in script?: Yes
Using distributed or parallel set-up in script?: No

Who can help

@stefan-it

Information

The model I am using (Bert, XLNet ...): LayoutLMforTokenClassification

The problem arises when using:

the official example scripts: (give details below)
my own modified scripts: (give details below)
My own modified script

The tasks I am working on is:

an official GLUE/SQUaD task: (give the name)
my own task or dataset: (give details below)
My own dataset

To reproduce

Steps to reproduce the behavior:

Expected behavior

This is more of a question rather than an issue.
When I trained the layoutlm model using my data and I used the tokenclassification model from huggingface, I got a small drop in performance. I wanted to ask if there are any differences between the two models? I have kept the hyper-parameters to be exactly the same in both cases.
Two key points where I found the differences were:
(1). When taking in the dataset - in the Microsoft version, there is a concept called "segment_ids" which is not a parameter in the huggingface layoutlm documentation.
(2). I loaded both the models and printed the number of layers in both, I saw that there is 1 extra layer called layoutlm.embeddings.position_ids in the huggingface implementation.

I am trying to find out the reason for the drop in performance. Hence, wanted to find out if there is any difference between the model implementations itself. It would be great help if you could help explain the two differences I found!

Thanks!

The text was updated successfully, but these errors were encountered:

NielsRogge · 2021-01-04T09:47:45Z

Hi there,

I made some integration tests for both the base model (LayoutLM) as well as the model with a token classification head on top (LayoutLMForTokenClassification). These integration tests do not reveal any differences in terms of output on the same input data between the original implementation and the one in HuggingFace Transformers. So the implementation seems to be OK. Btw, the segment_ids you are referring to are called token_type_ids in the Transformers library.

I also made a demo notebook that showcases how to fine-tune LayoutLMForTokenClassification on the FUNSD dataset, I'm getting quite good performance even though I'm not using Mask-RCNN features. Let me know if this helps you.

github-actions · 2021-03-06T00:14:19Z

This issue has been automatically marked as stale and been closed because it has not had recent activity. Thank you for your contributions.

If you think this still needs to be addressed please comment on this thread.

NielsRogge mentioned this issue Jan 8, 2021

Improve LayoutLM #9476

Merged

4 tasks

github-actions bot added the wontfix label Mar 6, 2021

github-actions bot closed this as completed Mar 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Differences between original implementation and HuggingFace implementation #9228

Differences between original implementation and HuggingFace implementation #9228

osabnis commented Dec 21, 2020

NielsRogge commented Jan 4, 2021 •

edited

github-actions bot commented Mar 6, 2021

Differences between original implementation and HuggingFace implementation #9228

Differences between original implementation and HuggingFace implementation #9228

Comments

osabnis commented Dec 21, 2020

Environment info

Who can help

Information

To reproduce

Expected behavior

NielsRogge commented Jan 4, 2021 • edited

github-actions bot commented Mar 6, 2021

NielsRogge commented Jan 4, 2021 •

edited