Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generation of training data for tables and figures outputs less element than in input #694

Closed

Conversation

lfoppiano
Copy link
Collaborator

@lfoppiano lfoppiano commented Jan 14, 2021

Attempt to fix #693, and is valid for both figures and tables.
One case is when only one figure/table is identified in the fulltext, it is not output in the respective training data.

The fix attempt to verify that there are not figure/tables accumulated but not consumed

NOTE: for reproducibility reasons, this fix has been merged from features/add-training-data, therefore I advice just cherry-pick the two commits that actually make this fix

lfoppiano and others added 30 commits October 23, 2020 09:43
@lfoppiano lfoppiano marked this pull request as draft January 14, 2021 03:50
@lfoppiano lfoppiano changed the title add missing figures when only one figure is present [wip] add missing figures when only one figure is present Jan 14, 2021
@coveralls
Copy link

coveralls commented Jan 14, 2021

Coverage Status

Coverage increased (+1.02%) to 39.179% when pulling 2aec236 on bugfix/fix-figure-table-training-data-generation into 5d2d814 on master.

@lfoppiano lfoppiano changed the title [wip] add missing figures when only one figure is present generation of training data for tables and figures outputs less element than in input Jan 27, 2021
@lfoppiano lfoppiano marked this pull request as ready for review January 27, 2021 02:37
@de-code
Copy link
Collaborator

de-code commented Jan 27, 2021

I guess because of the merge, GitHub seems to be confused about the files (which makes it impossible to review at least within GitHub).
What do you think about creating a separate branch and PR with just the three commits you mentioned?

@lfoppiano lfoppiano closed this Jan 27, 2021
@lfoppiano
Copy link
Collaborator Author

I did that in #702 ... hopefully I did not forget anything!

@lfoppiano lfoppiano deleted the bugfix/fix-figure-table-training-data-generation branch June 12, 2023 21:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Create training - missing figure/table training data as compared with the upstream model (fulltext)
3 participants