UTel

Introduction

Knowing Where and What: Unified Word Block Pretraining for Document Understanding

Our code is based on BROS.

Pre-trained models

name	# params
utel-base-uncased	110M
utel-large-uncased	340M

Fine-tuning on FUNSD

Prepare data

We conducted the FUNSD EE experiment based on the FUNSD data preprocessed in LayoutLM. Original code can be found in this link. To run it, please follow the steps below:

move to preprocess/funsd/.
run bash preprocess.sh.
run preprocess_2nd.py. This scripts converts the preprocessed data in LayoutLM to fit this repo.

Data will be created in datasets/funsd/.

Perform fine-tuning

Run the command below:

CUDA_VISIBLE_DEVICES=0 python train.py --config=configs/finetune_funsd_ee_bies.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
bros		bros
configs		configs
docs		docs
lightning_modules		lightning_modules
model		model
preprocess/funsd		preprocess/funsd
utel		utel
utils		utils
README.md		README.md
evaluate.py		evaluate.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bros

bros

configs

configs

docs

docs

lightning_modules

lightning_modules

model

model

preprocess/funsd

preprocess/funsd

utel

utel

utils

utils

README.md

README.md

evaluate.py

evaluate.py

requirements.txt

requirements.txt

train.py

train.py

Repository files navigation

UTel

Introduction

Pre-trained models

Fine-tuning on FUNSD

Prepare data

Perform fine-tuning

About

Releases

Packages

Languages

taosong2019/UTel

Folders and files

Latest commit

History

Repository files navigation

UTel

Introduction

Pre-trained models

Fine-tuning on FUNSD

Prepare data

Perform fine-tuning

About

Resources

Stars

Watchers

Forks

Languages