Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add text extraction? #5

Open
oaustegard opened this issue Jun 24, 2024 · 4 comments
Open

Add text extraction? #5

oaustegard opened this issue Jun 24, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@oaustegard
Copy link

While this is really neat and a quick shortcut, would be even better if you could add text extractor, since the OCR of Claude seems to not capture all the text of a screenshot. Today I use a bookmarklet for this (converting page to Markdown using turndown.js), but an extension like yours would definitely be more convenient

@polywock polywock added enhancement New feature or request labels Jun 24, 2024
@polywock
Copy link
Owner

Hello. Seems like a useful feature. How about a mode that includes document.body.innerHTML as an attached file? Is conversion to markdown necessary?

@oaustegard
Copy link
Author

oaustegard commented Jun 25, 2024 via email

@polywock
Copy link
Owner

polywock commented Jun 26, 2024

I was worried that including Turndown will balloon the extension's size too much, but it's very small.

In this update, I've included a new Mode called "Page data". Assuming you're on Chrome/Edge, you can try it out by...

  1. Extract the packed.zip into folder.
  2. Go to chrome://extensions
  3. Enabling Developer mode
  4. Click "Load unpacked" and load the extracted folder.

packed.zip

A few issues

  • Sometimes the page data is too large and doesn't fit Claude's context limit. How do you get around this?
  • Claude's website in general is very finicky, sometimes there's error messages like "format not supported".

@oaustegard
Copy link
Author

oaustegard commented Jun 26, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants