Added cross encoder re-ranker by yfulwani · Pull Request #3 · oneapi-src/vertical-search-engine

yfulwani · 2023-05-08T21:23:34Z

No description provided.

rbernalc · 2023-06-01T15:50:49Z

 ### Re-ranking

-In this reference kit, we focus on the document retrieval aspect of building a vertical search engine to obtain an initial list of the top-K most similar documents in the corpus for a given query.  Often times, this is sufficient for building a feature rich system.  However, in some situations, a 3rd component,  the re-ranker, which is not included in this reference kit, could be added to the search pipeline to improve results. In this architecture, for a given query, the *document retrieval* step will use one model to rapidly obtain a list of the top-K documents (as shown in this reference kit), followed by a *re-ranking* step which will use a different model to re-order the list of K retrieved documents before returning to the user.  The second re-ranking refinement step has been shown to improve user satisfaction, especially when fine-tuned on a custom corpus, but may be unnecessary as a starting point for building a functional vertical search engine.  To extend this reference implementation with re-ranking, we direct you to https://www.sbert.net/examples/applications/retrieve_rerank/README.html for further details on implementation where Intel® oneAPI optimizations can also be applied to speed up re-ranking models.
+In this reference kit, we focus on the document retrieval aspect of building a vertical search engine to obtain an initial list of the top-K most similar documents in the corpus for a given query.  Often times, this is sufficient for building a feature rich system.  However, in some situations, a 3rd component,  the re-ranker, could be added to the search pipeline to improve results. In this architecture, for a given query, the *document retrieval* step will use one model to rapidly obtain a list of the top-K documents, followed by a *re-ranking* step which will use a different model to re-order the list of K retrieved documents before returning to the user.  The second re-ranking refinement step has been shown to improve user satisfaction, especially when fine-tuned on a custom corpus, but may be unnecessary as a starting point for building a functional vertical search engine.  To know more about re-ranker, we direct you to https://www.sbert.net/examples/applications/retrieve_rerank/README.html for further details. In this reference kit we use `cross-encoder/ms-marco-MiniLM-L-6-v2` model as re-ranker. For more details about different re-ranker models visit https://www.sbert.net/docs/pretrained-models/ce-msmarco.html.


Thanks @yfulwani for this contribution but unfortunately we wouldn't accept PRs from your main branch. Please submit your PR from another branch in your forked repository and make sure your main branch remains identical with our main branch at all times. You are not supposed to manually push your main branch.

rbernalc · 2023-06-01T15:52:13Z

Hi @yfulwani If you need further assitance on the right process to submit PRs from a forked repository please let us know.

fixed typos

Added cross encoder re-ranker

9aa4f03

rbernalc suggested changes Jun 1, 2023

View reviewed changes

aagalleg pushed a commit that referenced this pull request Feb 16, 2024

Update README.md (#3)

b940a94

fixed typos

aagalleg deleted the branch oneapi-src:main February 16, 2024 20:05

aagalleg closed this Feb 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added cross encoder re-ranker#3

Added cross encoder re-ranker#3
yfulwani wants to merge 1 commit into
oneapi-src:mainfrom
yfulwani:main

yfulwani commented May 8, 2023

Uh oh!

rbernalc Jun 1, 2023

Uh oh!

rbernalc commented Jun 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yfulwani commented May 8, 2023

Uh oh!

rbernalc Jun 1, 2023

Choose a reason for hiding this comment

Uh oh!

rbernalc commented Jun 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants