fix(weaviate): support v4.6.3 #1134

tibor-reiss · 2024-05-24T13:59:48Z

Related to #1123

Bump weaviate from 3.26.2 to 4.6.3
Update instrumentation to be compatible with v4
Update the examples
Keep v3-specific code so that 3.26.2 still works

nirga · 2024-05-24T14:10:03Z

Thanks @tibor-reiss, did you try it to make sure it works with the new version?

tibor-reiss · 2024-05-24T14:14:37Z

Hi @nirga, yes, 9 passed, 1 skipped.
Took me a while to figure out that it was just a bump in the version in __init__ - but this was a good learning curve :)

Update: are there some other tests which need to be done apart from the unit tests?

nirga · 2024-05-24T14:17:04Z

@tibor-reiss can you run one of the sample apps that uses weaviate and make sure that you see traces, and add a screenshot here? I vaguely remember that there was a reason why @paolorechia who wrote this instrumentation didn't enable it for v4

tibor-reiss · 2024-05-24T21:35:34Z

@nirga Took me some time to set up a fully functioning environment locally - consequently, I have updated the sample apps to accommodate for local work. I adjusted v4 so it resembles v3.

Could you please check that it works with openai as well? The free tier was removed, so I tested with cohere.

It seems weaviate_v3.py still runs with weaviate-v4, but it will be deprecated at some point.

Screenshot attached from traceloop.

nirga · 2024-05-24T22:01:00Z

@tibor-reiss actually no :/ the GET / DELETE spans you're seeing are instrumented by otel's URLLIB3 that we're installing :/ Looks like the weaviate spans aren't there. You can easily see it by disabling all instrumentations except for weaviate, like this:

from traceloop.sdk.instruments import Instruments

Traceloop.init(instruments={Instruments.WEAVIATE})

From that, I'm guessing that if we update the test files to use the v4 syntax they will also start failing

tibor-reiss · 2024-05-26T11:07:23Z

Thanks for your patience @nirga! Below the updated screenshot. The weaviate api changed quite heavily. Combining their official documentation with the already present instrumentation, now there are references to "private" classes/methods. It works, but it means that this will probably change in the future. Let me know what you think.
Additionally, there might be other interesting methods to instrument - let me know.

I bumped the version just for this instrumentation to 0.20.0 since it's quite a big change.

Could you please check that the examples (weaviate_v4.py and weaviate_v3.py) work with the openai "backend" as well?

nirga · 2024-05-26T12:51:21Z

Thanks @tibor-reiss! Don't update version numbers, they get bumped automatically when we release 😃

So now it works both with old and new versions of weaviate? Or did we drop support for old versions?

tibor-reiss · 2024-05-26T18:41:54Z

@nirga Oh, sorry, I did not think that you would like to keep support for v3 - installing 0.19.0 with weaviate==3.26.2 would have still worked.

Anyway, I have put it back - with minor changes:

_GraphQLInstrumentor is the only overlap
marked the v3 so hopefully it can be easily deleted in the future
moved the previous tests to *_v3.py

So now both versions work - tested both the weaviate_v3.py and weaviate_v4.py.

Could you please add the cassettes for test_weaviate_instrumentation? I have generated them but they are with localhost since I am running weaviate via docker.

nirga · 2024-05-26T20:45:04Z

@tibor-reiss the recording of the cassettes don't work for you?

tibor-reiss · 2024-05-27T07:08:01Z

Good morning @nirga,

I can generate them - it just has localhost instead of traceloop :)

However, I have noticed that one test fails in test_weaviate_instrumentation.py with vcr enabled. In the old tests (now renamed to test_weaviate_instrumentation_v3.py), there was test_weaviate_create_batch - marked with "Flaky test" - maybe this was due to similar reason?

Example from test_weaviate_instrumentation.py: test_weaviate_query_fetch_object_by_id

Store the data from query_fetch_object_by_id() in a variable: data = query_fetch_object_by_id(client, uuid_value)
Add assert data.properties.get("author") == "Robert" -> this fails

It seems to me that vcr does not fetch all details, e.g. total_count (from test_weaviate_query_fetch_objects) is also not present in the response. So the test_weaviate_query_fetch_objects test is successful because the database is still running. Given that the db is still required to pass the tests, i.e. there are inserts/deletes happening, does vcr actually speed up them? Additionally, now that weaviate-v4 has a local option (e.g. with docker container - how I did the testing), is vcr needed? Let me know your thoughts please.

nirga · 2024-05-27T07:45:58Z

Hey @tibor-reis! What VCR does is just recording and replaying of HTTP requests, so as long as the database is sending back responses this should work. I can try and look into this as well today.

During CI, I think spinning off a local weaviate might be more cumbersome than recording and replaying HTTP responses but if you think it's easier and stable enough - let's do it. Can you assist in fixing the CI yaml though? (it's under the .github folder)

tibor-reiss · 2024-05-27T10:30:07Z

The reason for some test failures is that grpc is built into weaviate-v4, but this is not supported in vcrpy. As a solution for these tests, I have removed the vcr markers, and added a command line flag (with_grpc), i.e. the tests which use grpc (e.g. fetching) need a running weaviate instance.

nirga

Thanks @tibor-reiss! This looks overall good, 2 small comments:

Can you fix the lint issues?
Can you remove the cassettes that are no longer used?

nirga · 2024-05-27T14:45:27Z

packages/sample-app/sample_app/weaviate_v3.py

@@ -1,40 +1,37 @@
 # Tested with weaviate-client==3.26.0
+# Weaviate instrumentation with opentelemetry-instrumentation-weaviate==0.19.0


this can be removed, right? cause we support both

Are there any cassettes left which are not used? I removed both "fetch" cassettes, the rest is needed afais.

nirga changed the title ~~chore(deps-dev): bump weaviate-client from 3.26.2 to 4.6.3 in /packages/opentelemetry-instrumentation-weaviate~~ fix(weaviate): support v4.6.3 May 24, 2024

tibor-reiss force-pushed the bump-weaviate branch from 9acd429 to 8672dbb Compare May 26, 2024 11:04

tibor-reiss force-pushed the bump-weaviate branch from 8672dbb to 9b3776c Compare May 26, 2024 18:32

tibor-reiss force-pushed the bump-weaviate branch from edb8ef0 to 4f985a3 Compare May 26, 2024 19:06

tibor-reiss added 10 commits May 27, 2024 15:53

Bump weaviate

8ff7a53

Update due to api change

48f780b

Update examples

7285078

Keep v3 for backwards compatibility

6c8ad9b

Rename cassettes; update gql test

d946343

Fixed uuid so that cassettes can work

cd656fd

Add cassettes

08d2ec5

Run linters

3f4ab59

Update cassettes and remove vcr from fetch tests due to grpc

db7d341

Mark tests which need a running weaviate instance due to grpc

64df7f3

tibor-reiss force-pushed the bump-weaviate branch from cac8137 to 64df7f3 Compare May 27, 2024 13:53

nirga reviewed May 27, 2024

View reviewed changes

tibor-reiss and others added 2 commits May 27, 2024 18:56

Input from code review

8038841

Merge branch 'main' into bump-weaviate

698c5e5

nirga approved these changes May 28, 2024

View reviewed changes

nirga merged commit 9103977 into traceloop:main May 28, 2024
8 checks passed

tibor-reiss deleted the bump-weaviate branch July 5, 2024 04:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(weaviate): support v4.6.3 #1134

fix(weaviate): support v4.6.3 #1134

tibor-reiss commented May 24, 2024 •

edited

Loading

nirga commented May 24, 2024

tibor-reiss commented May 24, 2024 •

edited

Loading

nirga commented May 24, 2024 •

edited

Loading

tibor-reiss commented May 24, 2024 •

edited

Loading

nirga commented May 24, 2024

tibor-reiss commented May 26, 2024 •

edited

Loading

nirga commented May 26, 2024

tibor-reiss commented May 26, 2024 •

edited

Loading

nirga commented May 26, 2024

tibor-reiss commented May 27, 2024

nirga commented May 27, 2024

tibor-reiss commented May 27, 2024 •

edited

Loading

nirga left a comment

nirga May 27, 2024

tibor-reiss May 27, 2024 •

edited

Loading

		@@ -1,40 +1,37 @@
		# Tested with weaviate-client==3.26.0
		# Weaviate instrumentation with opentelemetry-instrumentation-weaviate==0.19.0

fix(weaviate): support v4.6.3 #1134

fix(weaviate): support v4.6.3 #1134

Conversation

tibor-reiss commented May 24, 2024 • edited Loading

nirga commented May 24, 2024

tibor-reiss commented May 24, 2024 • edited Loading

nirga commented May 24, 2024 • edited Loading

tibor-reiss commented May 24, 2024 • edited Loading

nirga commented May 24, 2024

tibor-reiss commented May 26, 2024 • edited Loading

nirga commented May 26, 2024

tibor-reiss commented May 26, 2024 • edited Loading

nirga commented May 26, 2024

tibor-reiss commented May 27, 2024

nirga commented May 27, 2024

tibor-reiss commented May 27, 2024 • edited Loading

nirga left a comment

Choose a reason for hiding this comment

nirga May 27, 2024

Choose a reason for hiding this comment

tibor-reiss May 27, 2024 • edited Loading

Choose a reason for hiding this comment

tibor-reiss commented May 24, 2024 •

edited

Loading

tibor-reiss commented May 24, 2024 •

edited

Loading

nirga commented May 24, 2024 •

edited

Loading

tibor-reiss commented May 24, 2024 •

edited

Loading

tibor-reiss commented May 26, 2024 •

edited

Loading

tibor-reiss commented May 26, 2024 •

edited

Loading

tibor-reiss commented May 27, 2024 •

edited

Loading

tibor-reiss May 27, 2024 •

edited

Loading