[Post 0.7] [DocClassifier] PDF Document Classification #2864

+            for per_doc in docs:
+                # If there is non-pdf document, return False
+                if not per_doc.endswith(".pdf"):
+                    return False


Just curious about a question. Do we have some logics to handle the situation in which a non-PDF document is encountered? Are there any warnings return to the users?

github-actions · 2023-03-04T06:02:58Z

Job PR-2864-77eb107 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2864/77eb107/index.html

Ubuntu and others added 8 commits February 3, 2023 22:38

add missing documents support and exception handling

4ec046a

remove dependence on lightning cloud_io

ab6321d

add zero bbox

670c5e2

Merge branch 'autogluon:master' into doc_class

109b189

Update cloud_io.py

169c234

Merge branch 'autogluon:master' into doc_class

932ef1e

PDF document classification

052f837

Merge branch 'autogluon:master' into doc_class

dcc6f53

cheungdaven added the model list checked You have updated the model list after modifying multimodal unit tests/docs label Feb 8, 2023

cheungdaven changed the title ~~[WIP] PDF Document Classification~~ [WIP] [post 0.7] PDF Document Classification Feb 8, 2023

Ubuntu and others added 3 commits February 8, 2023 18:08

fix ci issue

109faed

change config in pdf-2-image conversion

54482ce

Merge branch 'autogluon:master' into doc_class

42a4d00

cheungdaven and others added 2 commits February 10, 2023 15:03

Merge branch 'autogluon:master' into doc_class

fcf6402

add tutorial for PDF doc classification

7fd93ba

cheungdaven changed the title ~~[WIP] [post 0.7] PDF Document Classification~~ [DocClassifier] [post 0.7] PDF Document Classification Feb 10, 2023

cheungdaven requested a review from zhiqiangdon February 10, 2023 23:15

Update pdf_classification.md

1293466

Update pdf_classification.md

d554c5b

Ubuntu and others added 4 commits February 11, 2023 05:33

fix pdf visualization

b2fd5e9

Update pdf_classification.md

1359c28

Update pdf_classification.md

025392e

Update pdf_classification.md

9a70202

cheungdaven requested review from sxjscience and yzhliu February 13, 2023 18:51

Merge branch 'master' into doc_class

1ec7ce9

zhiqiangdon reviewed Mar 1, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/data/infer_types.py Show resolved Hide resolved

zhiqiangdon reviewed Mar 1, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/data/infer_types.py Outdated Show resolved Hide resolved

zhiqiangdon reviewed Mar 1, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/data/infer_types.py Outdated Show resolved Hide resolved

zhiqiangdon reviewed Mar 1, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/data/process_document.py Show resolved Hide resolved

cheungdaven and others added 2 commits March 3, 2023 11:35

Merge branch 'autogluon:master' into doc_class

62785c1

address feedback

dc9d81a

cheungdaven requested review from suzhoum, tonyhoo and yongxinw March 3, 2023 22:18

Ubuntu and others added 4 commits March 3, 2023 22:54

remove tutorial

8968a8a

Update index.rst

d84ad97

Update index.rst

98ba24a

Update index.rst

8287e89

zhiqiangdon reviewed Mar 4, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/predictor.py Outdated Show resolved Hide resolved

Update predictor.py

8c3bf1b

zhiqiangdon reviewed Mar 4, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/predictor.py Outdated Show resolved Hide resolved

Update predictor.py

e85a701

zhiqiangdon approved these changes Mar 4, 2023

View reviewed changes

Update predictor.py

1dac925

Update predictor.py

77eb107

Harry-zzh reviewed Mar 4, 2023

View reviewed changes

cheungdaven merged commit d071463 into autogluon:master Mar 6, 2023

cheungdaven deleted the doc_class branch March 15, 2023 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Post 0.7] [DocClassifier] PDF Document Classification #2864

[Post 0.7] [DocClassifier] PDF Document Classification #2864

cheungdaven commented Feb 8, 2023 •

edited

github-actions bot commented Feb 9, 2023

github-actions bot commented Feb 11, 2023

github-actions bot commented Feb 11, 2023

github-actions bot commented Feb 11, 2023

github-actions bot commented Mar 1, 2023

github-actions bot commented Mar 3, 2023

github-actions bot commented Mar 4, 2023

github-actions bot commented Mar 4, 2023

github-actions bot commented Mar 4, 2023

github-actions bot commented Mar 4, 2023

Harry-zzh Mar 4, 2023

github-actions bot commented Mar 4, 2023

[Post 0.7] [DocClassifier] PDF Document Classification #2864

[Post 0.7] [DocClassifier] PDF Document Classification #2864

Conversation

cheungdaven commented Feb 8, 2023 • edited

github-actions bot commented Feb 9, 2023

github-actions bot commented Feb 11, 2023

github-actions bot commented Feb 11, 2023

github-actions bot commented Feb 11, 2023

github-actions bot commented Mar 1, 2023

github-actions bot commented Mar 3, 2023

github-actions bot commented Mar 4, 2023

github-actions bot commented Mar 4, 2023

github-actions bot commented Mar 4, 2023

github-actions bot commented Mar 4, 2023

Harry-zzh Mar 4, 2023

Choose a reason for hiding this comment

github-actions bot commented Mar 4, 2023

cheungdaven commented Feb 8, 2023 •

edited