-
Notifications
You must be signed in to change notification settings - Fork 233
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docs: make document indices self-contained #1678
Conversation
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Codecov ReportPatch coverage has no change and project coverage change:
Additional details and impacted files@@ Coverage Diff @@
## main #1678 +/- ##
===========================================
- Coverage 84.46% 30.06% -54.40%
===========================================
Files 134 133 -1
Lines 8838 8731 -107
===========================================
- Hits 7465 2625 -4840
- Misses 1373 6106 +4733
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ensure:
- Consistent use of document/Document
- H1 headings are title case, H2 and lower are sentence case
|
||
### 1.2.1. WCS (managed instance) | ||
**WCS (managed instance)** |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should these be h4 headings?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes but the table of contets got too verbose
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think there is a small issue with the way things are currently set up.
The pages on HNSWLib and InMemory seem to be fully self-contained, giving general information like the following:
But this (and some other backend agnostic information) seems to be missing from the pages describing other backends (weavaite, qdrant, etc).
If we want thins to be truly self-contained, then we should just copy that sort of information to all the backends imo.
But I like the direction this is going into!!
To combine these operations into a single, hybrid search query, you can use the query builder that is accessible | ||
through [build_query()][docarray.index.abstract.BaseDocIndex.build_query]: | ||
|
||
### Initialize the Document Index and add data | ||
```python |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we add 1-2 sentences of explanation to these code snippets? i think they are very self-explanatory, but personally as a user i don't like code snippets without words around them :)
) | ||
retrieved_docs, scores = doc_index.execute_query(query) | ||
``` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we should here again add a big fat link to all the backend documentation pages and tell people that they can get more detailed information there
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Key points:
- Change all "top 10 most similar" to "ten most similar" throughout
- Wrap all methods in backticks. Some still lack them
- H2 headings and lower should be sentence case. Some are still title case
There are a few other items throughout the review too
Even better than backticks would be to link the the corresponding docstring / API docs :) |
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
already linked, they just lacked the backticks |
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Final nitpicks
Signed-off-by: jupyterjazz <saba.sturua@jina.ai>
This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size. |
📝 Docs are deployed on https://ft-docs-self-contained-indices--jina-docs.netlify.app 🎉 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
big win for people that are interested in a specific backend!
As stated in #1603, our documentation for doc indices is not consistent and lacks some of the functionalities.
Definition of Done:
High-level Template: