RankLLM Revamp - VLLM support, APEER prompt, fixed caching, improved documentation #119

ronakice · 2024-05-22T18:27:08Z

Pull Request Checklist

Reference Issue

Please provide the reference to issue this PR is addressing (# followed by the issue number). If there is no associated issue, write "N/A".

ref:

Checklist Items

Before submitting your pull request, please review these items:

Have you followed the contributing guidelines?
Have you verified that there are no existing Pull Requests for the same update/change?
Have you updated any relevant documentation or added new tests where needed?

PR Type

What kind of change does this PR introduce?

src/rank_llm/retrieve/retriever.py

src/rank_llm/rerank/rank_listwise_os_llm.py

sahel-sh · 2024-05-24T02:37:17Z

src/rank_llm/scripts/run_rank_llm.py

@@ -153,5 +155,10 @@ def main(args):
        default="You are RankLLM, an intelligent assistant that can rank passages based on their relevancy to the query.",
        help="the system message used in prompts",
    )
+    parser.add_argument(
+        "--batched",


This batch can be confused with rerank_batch, the latter is for reranking multiple queries regardless of if it is happening one query at a time or multiple queries at once. The former refers to reranking multiple queries at once. from the default param values it looks like we want to support rerank_batch function with batch=false? if yes, I think the two different meanings of batch can be confusing, maybe we need to rename one of the two usecases? WDYT @ronakice , @lintool

Hmm i agree this could be a source of confusion for users. Batching usually means multiple queries being processed at once correct? In which case rerank_batch needs renaming?

with my understanding batch processing is async and yes it is normally multiple queries at once, but it doesn't restrict the batch size to be strictly greater than 1. IIRC, @lintool chose rerank and rerank_batch to align with pyserini function names (retrieve and retrieve_batch).
@ronakice WDYT about changing --batched to --vllm-batched?
I propose to have an --inference-method enum (or some similar name) with FastChat as the default value. The user can set it to vllm if they want to use vllm rather than fastchat inference.

I can add these two yes 👍🏼 I agree for the time being this is more clear

@sahel-sh would we need to mention the --inference-method; --vllm-batched is the only vllm setting, otherwise it defaults to FastChat.

Less than Ideal since batch in rerank_batch has a different implication than batch in vllm-batched, but it is ok for now

pyproject.toml

run_beir.sh

src/rank_llm/rerank/rank_gpt.py

src/rank_llm/rerank/rank_listwise_os_llm.py

sahel-sh · 2024-07-16T15:37:40Z

src/rank_llm/scripts/convert_json_to_jsonl.py

+import json
+import os
+
+def convert_json_to_jsonl(input_file, output_file):


why is this needed?

Convert from unsupported JSON request format to newer JSONL request format.

got it, thanks, please mention where the data class that the new format is comining from in the high level comment above.

ronakice and others added 3 commits May 16, 2024 13:28

fix base

ac26fb5

VLLM new support

bb1eb71

cleanup

53a93ee

ronakice requested a review from sahel-sh May 22, 2024 18:27

cleanup

d29d1ee

sahel-sh reviewed May 24, 2024

View reviewed changes

src/rank_llm/retrieve/retriever.py Outdated Show resolved Hide resolved

sahel-sh reviewed May 24, 2024

View reviewed changes

src/rank_llm/retrieve/retriever.py Outdated Show resolved Hide resolved

sahel-sh reviewed May 24, 2024

View reviewed changes

src/rank_llm/rerank/rank_listwise_os_llm.py Show resolved Hide resolved

sahel-sh reviewed May 24, 2024

View reviewed changes

pyproject.toml Show resolved Hide resolved

ronakice and others added 8 commits May 24, 2024 08:42

vllm optional fix

8de9483

batch prompt, TP

53f3b7d

Merge branch 'vllm_new' of github.com:castorini/rank_llm into vllm_new

1cd4e2c

some more updates

8ab04f3

Merge branch 'main' into vllm_new

a4412b3

update to default to JSONL

68b9c2d

updates

ab43cc3

More updates

0a131de

ronakice changed the title ~~VLLM Support~~ RankLLM Revamp - VLLM support, APEER prompt, fixed caching, improved documentation Jul 15, 2024

ronakice requested a review from sahel-sh July 15, 2024 18:12

sahel-sh reviewed Jul 16, 2024

View reviewed changes

resolve comments

81e7081

sahel-sh approved these changes Jul 17, 2024

View reviewed changes

ronakice added 3 commits July 17, 2024 14:45

Update with pointer to new Request

96905cc

pre-commit

1e4cca6

pre-commit cleanup

11ccab2

ronakice merged commit 851471f into main Jul 17, 2024

ronakice deleted the vllm_new branch July 17, 2024 15:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RankLLM Revamp - VLLM support, APEER prompt, fixed caching, improved documentation #119

RankLLM Revamp - VLLM support, APEER prompt, fixed caching, improved documentation #119

ronakice commented May 22, 2024

sahel-sh May 24, 2024

ronakice May 24, 2024

sahel-sh May 25, 2024

ronakice May 28, 2024

ronakice Jul 16, 2024

sahel-sh Jul 16, 2024

sahel-sh Jul 16, 2024

ronakice Jul 17, 2024

sahel-sh Jul 17, 2024

RankLLM Revamp - VLLM support, APEER prompt, fixed caching, improved documentation #119

RankLLM Revamp - VLLM support, APEER prompt, fixed caching, improved documentation #119

Conversation

ronakice commented May 22, 2024

Pull Request Checklist

Reference Issue

Checklist Items

PR Type

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment