Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(docs/GPT-4 Vision Model): Add document on extracting tables from images and related functionality #443

Merged
merged 5 commits into from
Feb 20, 2024

Conversation

jxnl
Copy link
Owner

@jxnl jxnl commented Feb 18, 2024

Ellipsis 馃殌 This PR description was created by Ellipsis for commit ae9615e.

Summary:

This PR adds a new document on extracting tables from images using GPT-4 Vision Model, introduces new classes and functions for handling the extracted data in the new document, and updates the site navigation accordingly.

Key points:

  • Added a new document tables_from_vision.md in docs/hub.
  • Introduced MarkdownDataFrame, Table, and MultipleTables classes in the new document.
  • Added extract function for interacting with GPT-4 model in the new document.
  • Updated mkdocs.yml to include the new document in the navigation.

Generated with 鉂わ笍 by ellipsis.dev

@ellipsis-dev ellipsis-dev bot changed the title ... feat(docs/GPT-4 Vision Model): Add document on extracting tables from images and related functionality Feb 18, 2024
Copy link
Contributor

@ellipsis-dev ellipsis-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No problems found on commit f17bc98


Something look wrong? You can customize Ellipsis by editing the ellipsis.yaml for this repository.

Generated with 鉂わ笍 by ellipsis.dev

Copy link
Contributor

@ellipsis-dev ellipsis-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me! Incremental review on commit c3131f4

Reviewed 12 lines of code across 1 files in 32 second(s).

See details
  • Skipped files: 0
  • Confidence threshold: 85%
  • Drafted 1 additional comments.
  • Workflow ID: wflow_nUKKGGZA71CWJBcR
View 1 draft comments

These comments were drafted by Ellipsis, but were filtered out of the final review. They're included here so you can see our internal thought process and help you configure your ellipsis.yaml.

Drafted 1 comments under confidence threshold

Filtered comment at mkdocs.yml:171

Notes: The PR author has added a new document to the docs/hub directory and updated the mkdocs.yml file to include this new document in the navigation. This is the correct way to add a new document to the documentation. I don't see any issues with this change.

The new document has been correctly added to the mkdocs.yml file. Good job!

Confidence changes required: 0%


Something look wrong? You can customize Ellipsis by editing the ellipsis.yaml for this repository.

Generated with 鉂わ笍 by ellipsis.dev

Copy link
Contributor

ellipsis-dev bot commented Feb 18, 2024

Skipped PR review on 10dd869 because no changed files had a supported extension. If you think this was in error, please contact us and we'll fix it right away.


Something look wrong? You can customize Ellipsis by editing the ellipsis.yaml for this repository.

Generated with 鉂わ笍 by ellipsis.dev

@jxnl jxnl merged commit ae9615e into main Feb 20, 2024
11 of 12 checks passed
@jxnl jxnl deleted the hub-vision-table branch February 20, 2024 02:46
Copy link
Contributor

@ellipsis-dev ellipsis-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me! Incremental review on commit ae9615e

Reviewed 186 lines of code across 2 files in 53 second(s).

See details
  • Skipped files: 2 (please contact us to request support for these files): poetry.lock, pyproject.toml
  • Confidence threshold: 85%
  • Drafted 1 additional comments.
  • Workflow ID: wflow_g6KAXWbWK4fATBCi
View 1 draft comments

These comments were drafted by Ellipsis, but were filtered out of the final review. They're included here so you can see our internal thought process and help you configure your ellipsis.yaml.

Drafted 1 comments under confidence threshold

Filtered comment at docs/hub/tables_from_vision.md:35

Notes: The new document tables_from_vision.md is added to the docs/hub directory and the mkdocs.yml file is updated to include this new document in the site navigation. The code in the new document seems to be well-structured and follows the best practices. However, there is a potential issue with the md_to_df function. This function is supposed to convert a markdown table into a pandas DataFrame. However, it seems to be assuming that the first column of the table will be used as the index of the DataFrame. This might not always be the case and could lead to incorrect results if the first column is not suitable to be used as an index. It would be better to not make this assumption and let pandas choose the index automatically.

The md_to_df function assumes that the first column of the table will be used as the index of the DataFrame. This might not always be the case and could lead to incorrect results if the first column is not suitable to be used as an index. Consider removing the index_col=1 argument to let pandas choose the index automatically.

pd.read_csv(StringIO(data), sep='|').dropna(axis=1, how='all').iloc[1:].applymap(lambda x: x.strip())

Confidence changes required: 80%


Something look wrong? Tag @ellipsis-dev in a comment, or customize the ellipsis.yaml for this repository.

Generated with 鉂わ笍 by ellipsis.dev

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant