Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: choose multiple clusters if necessary #2

Merged
merged 1 commit into from
May 6, 2024

Conversation

Aaditya-Sahay
Copy link
Member

No description provided.

@Aaditya-Sahay Aaditya-Sahay force-pushed the feat/parsing-multiple-clusters branch from 8b1694a to 6043721 Compare May 4, 2024 15:56
@Saghen Saghen merged commit 26e4ff6 into feat/websearch-parsing May 6, 2024
2 of 3 checks passed
@Saghen Saghen deleted the feat/parsing-multiple-clusters branch May 6, 2024 15:58
Saghen added a commit that referenced this pull request May 13, 2024
* feat: playwright, spatial parsing, markdown for web search

Co-authored-by: Aaditya Sahay <aadityasahay1@gmail.com>

* feat: choose multiple clusters if necessary (#2)

* chore: resolve linting failures

* feat: improve paring performance and error messages

* feat: combine embeddable chunks together on cpu

* feat: reduce parsed pages from 10 to 8

* feat: disable javascript in playwright by default

* feat: embedding and parsing error messages

* feat: move isURL, fix type errors, misc

* feat: misc cleanup

* feat: change serializedHtmlElement to interface

* fix: isUrl filename

* fix: add playwright dependencies to docker

* feat: add playwright browsers to docker image

* feat: enable javascript by default

* feat: remove error message from console on failed page

---------

Co-authored-by: Aaditya Sahay <aadityasahay1@gmail.com>
Co-authored-by: Aaditya Sahay <56438732+Aaditya-Sahay@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants