Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

[FiD] Add specialized chunking to search engine retrievers #4227

Merged
merged 4 commits into from Dec 8, 2021

Conversation

klshuster
Copy link
Contributor

Patch description
This PR is intended to copy the functionality of the woi_chunk_retrieved_docs mutator within the search engine retriever.

Testing steps
Added CI

for i in range(0, len(tokens), self.len_chunk)
]
return self.chunk_reranker.get_top_chunks(query, doc_chunks)
if self.chunk_reranker != 'woi_chunk_retrieved_docs':
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't this be the chunk_ranker_type? I think you are comparing the string to the ChunkRanker object here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oh yes you're right

Copy link
Contributor

@mojtaba-komeili mojtaba-komeili left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Other than the comment I had the rest LGTM.

@klshuster klshuster merged commit 6012f84 into main Dec 8, 2021
@klshuster klshuster deleted the woi_chunking branch December 8, 2021 19:49
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants