Table-Recognition-base-on-Transformer-Decoder

An end to end model for two sub-tasks of Table Recognition: table structure recognition, cell detection

Dataset: Pubtabnet

Consists one of Shared Encoder, one Shared Decoder and three separate Decoder for three sub-tasks
- Shared Encoder using a CNN backbone network as the feature extractor
- Four Decoders are inspired by original Transformer decoder

config.py contains hyperparameters
parsing_data.py match raw data from Pubtabnet to anotation
tokenizer.py encode characters, html tags
sub_module.py build necessary sub-modules like Cross Attention, Self Attention, Positional Encoding, ...
main_model build last model from sub-modules
train_infer.py train loop

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
config.py		config.py
custom_dataset.py		custom_dataset.py
main_model.py		main_model.py
parsing_data.py		parsing_data.py
requirement.txt		requirement.txt
sub_modules.py		sub_modules.py
tokenizer.py		tokenizer.py
train_infer.py		train_infer.py