V-Doc : Visual questions answers with Documents

This repository contains code for the paper V-Doc : Visual questions answers with Documents. The videos can be accessed by this link.

Ding, Y., Huang, Z., Wang, R., Zhang, Y., Chen, X., Ma, Y., Chung, H., & Han, C. (CVPR 2022)
V-Doc : Visual questions answers with Documents

Dataset in Dataset Storage Module

The dataset we used to trained the model is provided in following links:

PubVQA Dataset for training Mac-Network.

Dataset for training LayoutLMv2(FUNSD-QA).

Dataset Generation

To run the scene based question generation code, we need to fetch the JSON files from the source.

Extract OCR information

python3 ./document_collection.py

After the step above, a new folder called ./input_ocr will be generated.

Generate questions

python3 ./scene_based/pdf_generate_question.py

To limit the number of generated questions, you can change the code in pdf_generate_question.py line 575 and line 591-596

After the steps above, you can see a json file under the ./output_qa_dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
images		images
ocr_json_files		ocr_json_files
scene_based_templates		scene_based_templates
README.md		README.md
document_collection.py		document_collection.py
pdf_generate_question.py		pdf_generate_question.py
pdf_question_engine.py		pdf_question_engine.py
synonyms.json		synonyms.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

ocr_json_files

ocr_json_files

scene_based_templates

scene_based_templates

README.md

README.md

document_collection.py

document_collection.py

pdf_generate_question.py

pdf_generate_question.py

pdf_question_engine.py

pdf_question_engine.py

synonyms.json

synonyms.json

Repository files navigation

V-Doc : Visual questions answers with Documents

Ding, Y., Huang, Z., Wang, R., Zhang, Y., Chen, X., Ma, Y., Chung, H., & Han, C. (CVPR 2022)
V-Doc : Visual questions answers with Documents

Dataset in Dataset Storage Module

Dataset Generation

Extract OCR information

Generate questions

About

Releases

Packages

Contributors 4

Languages

usydnlp/vdoc

Folders and files

Latest commit

History

Repository files navigation

V-Doc : Visual questions answers with Documents

Ding, Y.*, Huang, Z.*, Wang, R., Zhang, Y., Chen, X., Ma, Y., Chung, H., & Han, C. (CVPR 2022) V-Doc : Visual questions answers with Documents

Dataset in Dataset Storage Module

Dataset Generation

Extract OCR information

Generate questions

About

Resources

Stars

Watchers

Forks

Languages

Ding, Y., Huang, Z., Wang, R., Zhang, Y., Chen, X., Ma, Y., Chung, H., & Han, C. (CVPR 2022)
V-Doc : Visual questions answers with Documents