langchain[minor]: Make EmbeddingsFilters async #22737

pprados · 2024-06-10T13:55:42Z

Thank you for contributing to LangChain!

PR title: community: EmbeddingsFilters is not compatible with async
PR message:
- Description: EmbeddingsFilters is not compatible with async
- Issue: Current implementation call sync methods
- Twitter handle: pprados
Add tests and docs: Test were updated
Lint and test: Run make format, make lint and make test from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/

@eyurtsev, another review ?

vercel · 2024-06-10T13:55:46Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Jun 12, 2024 10:50am

pprados · 2024-06-10T14:20:40Z

@eyurtsev
Once again, an async bug

eyurtsev

Looks good -- could you undo changes in pyproject.toml and poetry.lock -- most PRs should leave those unomdified

pyproject.toml

libs/community/tests/integration_tests/retrievers/test_contextual_compression.py

.../community/tests/integration_tests/retrievers/document_compressors/test_embeddings_filter.py

libs/community/tests/integration_tests/retrievers/document_compressors/test_base.py

…dings_filter

pprados · 2024-06-11T07:26:09Z

Hello @eyurtsev

In test_embeddings_filter.py, I change
from langchain_community.embeddings import OpenAIEmbeddings
to
from langchain_openai.embeddings import OpenAIEmbeddings

But, for that, I need to add a dependency in pyproject.tml

[tool.poetry.group.test_integration.dependencies]
...
langchain-openai = ">=0.1.8"

But, the poetry lock --no-update return:

Because langchain-openai (0.1.8) depends on tiktoken (>=0.7,<1)
 and no versions of langchain-openai match >0.1.8, langchain-openai (>=0.1.8) requires tiktoken (>=0.7,<1).
So, because langchain-community depends on both tiktoken (>=0.3.2,<0.6.0) and langchain-openai (>=0.1.8), version solving failed.

Do you think I should keep the original import?

eyurtsev · 2024-06-11T14:07:03Z

Hello @eyurtsev

In test_embeddings_filter.py, I change from langchain_community.embeddings import OpenAIEmbeddings to from langchain_openai.embeddings import OpenAIEmbeddings

But, for that, I need to add a dependency in pyproject.tml
[tool.poetry.group.test_integration.dependencies]
...
langchain-openai = ">=0.1.8"
But, the poetry lock --no-update return:
Because langchain-openai (0.1.8) depends on tiktoken (>=0.7,<1)
 and no versions of langchain-openai match >0.1.8, langchain-openai (>=0.1.8) requires tiktoken (>=0.7,<1).
So, because langchain-community depends on both tiktoken (>=0.3.2,<0.6.0) and langchain-openai (>=0.1.8), version solving failed.
Do you think I should keep the original import?

Yes, please keep the original import.

If you want to spend more effort, you could fix the test logic itself (not required for this PR) -- the test should not depend on the embedding provider, and instead can leverage fake embeddings in langchain_core. If that's done, then the test can be moved into unit tests.

pprados · 2024-06-12T11:26:05Z

@eyurtsev
"The Integration docs lint" checks is blocked in all the worlflow.

eyurtsev · 2024-06-11T14:02:12Z

libs/community/pyproject.toml

@@ -67,6 +67,7 @@ tiktoken = ">=0.3.2,<0.6.0"
 anthropic = "^0.3.11"
 langchain-core = { path = "../core", develop = true }
 langchain = { path = "../langchain", develop = true }
+#langchain-openai = ">=0.1.8"


Could you revert changes in pyproject toml and poetry lock please? Most PRs should not be modifying these files

.../community/tests/integration_tests/retrievers/document_compressors/test_embeddings_filter.py

libs/community/tests/integration_tests/retrievers/test_contextual_compression.py

Add native async implementation for EmbeddingsFilter

Make EmbeddingsFilters async

d9f4eff

pprados force-pushed the pprados/fix_embeddings_filter branch from 7bd7f32 to d9f4eff Compare June 10, 2024 14:14

pprados marked this pull request as ready for review June 10, 2024 14:20

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. Ɑ: embeddings Related to text embedding models module 🤖:improvement Medium size change to existing code to handle new use-cases labels Jun 10, 2024

eyurtsev reviewed Jun 10, 2024

View reviewed changes

Fix test

e7428d8

eyurtsev self-assigned this Jun 10, 2024

pprados added 7 commits June 10, 2024 17:19

Fix poetry

c82b2ff

Fix

ba40a93

Merge branch 'master' into pprados/fix_embeddings_filter

351f689

Fix openai dependencies

f9d7419

Merge branch 'master' into pprados/fix_embeddings_filter

6dc472e

Fix openai dependencies

0be391b

Merge branch 'master' into pprados/fix_embeddings_filter

e1c664f

pprados changed the title ~~Make EmbeddingsFilters async~~ langchain[minor]: Make EmbeddingsFilters async Jun 11, 2024

pprados added 2 commits June 11, 2024 09:08

Merge remote-tracking branch 'upstream/master' into pprados/fix_embed…

af5d4c3

…dings_filter

Fix

014aa8e

pprados marked this pull request as draft June 11, 2024 07:16

pprados added 2 commits June 11, 2024 09:27

Fix openai dependency

62ecc17

format

ee9e388

pprados marked this pull request as ready for review June 11, 2024 07:37

pprados added 3 commits June 12, 2024 09:23

format

3bbbcf0

format

bcfff13

format

1c94bde

pprados added 2 commits June 12, 2024 09:30

Merge branch 'master' into pprados/fix_embeddings_filter

0914403

format

830403f

eyurtsev approved these changes Jun 12, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Jun 12, 2024

eyurtsev merged commit 23c22fc into langchain-ai:master Jun 12, 2024
61 checks passed

pprados deleted the pprados/fix_embeddings_filter branch June 18, 2024 06:33

hinthornw pushed a commit that referenced this pull request Jun 20, 2024

langchain[minor]: Make EmbeddingsFilters async (#22737)

6aee08c

Add native async implementation for EmbeddingsFilter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

langchain[minor]: Make EmbeddingsFilters async #22737

langchain[minor]: Make EmbeddingsFilters async #22737

pprados commented Jun 10, 2024

vercel bot commented Jun 10, 2024 •

edited

Loading

pprados commented Jun 10, 2024

eyurtsev left a comment

pprados commented Jun 11, 2024

eyurtsev commented Jun 11, 2024

pprados commented Jun 12, 2024

eyurtsev Jun 11, 2024

langchain[minor]: Make EmbeddingsFilters async #22737

langchain[minor]: Make EmbeddingsFilters async #22737

Conversation

pprados commented Jun 10, 2024

vercel bot commented Jun 10, 2024 • edited Loading

pprados commented Jun 10, 2024

eyurtsev left a comment

Choose a reason for hiding this comment

pprados commented Jun 11, 2024

eyurtsev commented Jun 11, 2024

pprados commented Jun 12, 2024

eyurtsev Jun 11, 2024

Choose a reason for hiding this comment

vercel bot commented Jun 10, 2024 •

edited

Loading