added offsetbias execute prompt and judgement process code #159

sanghyuk-choi · 2024-07-25T12:47:53Z

I've just added prompt and judgement code for NCSOFT/Llama-3-OffsetBias-8B.

natolambert · 2024-07-25T15:46:43Z

@sanghyuk-choi need some changes to support this model. E.g. which version of VLLM do we need? Seems like we at least need that (could not run this on my previous image).

EDIT: This may be a different issue in run_generative, checking.

natolambert · 2024-07-25T15:55:39Z

@sanghyuk-choi I made changes directly here: sanghyuk-choi#1

natolambert · 2024-07-25T16:19:03Z

Scores are live, so we should be able to merge this soon.

cleaning / finishing pr

…bench into offsetbias

sanghyuk-choi · 2024-07-26T02:23:18Z

also I have fixed my mistake by adding model_modifier when calling process_judgement in run_judge_pair function.

natolambert · 2024-07-26T02:49:09Z

Nice, yeah @sanghyuk-choi this looks good. Sorry for lumping a bunch of small fixes into your code, I got a little carried away. merging as long as checks pass.

added offsetbias execute prompt and judgement process code

494c17f

sanghyuk-choi mentioned this pull request Jul 25, 2024

Add new Reward Model and Generative Model. #158

Closed

natolambert linked an issue Jul 25, 2024 that may be closed by this pull request

Add new Reward Model and Generative Model. #158

Closed

cleaning / finishing pr

4ab465b

sanghyuk-choi and others added 3 commits July 26, 2024 11:07

Merge pull request #1 from allenai/offsetbias

5d5e5c6

cleaning / finishing pr

fixed bugs on run_judge_pair by adding model_modifier arg

0bd91ce

Merge branch 'offsetbias' of https://github.com/sanghyuk-choi/reward-…

033defd

…bench into offsetbias

natolambert merged commit bc72fb2 into allenai:main Jul 26, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added offsetbias execute prompt and judgement process code #159

added offsetbias execute prompt and judgement process code #159

sanghyuk-choi commented Jul 25, 2024

natolambert commented Jul 25, 2024 •

edited

Loading

natolambert commented Jul 25, 2024 •

edited

Loading

natolambert commented Jul 25, 2024

sanghyuk-choi commented Jul 26, 2024

natolambert commented Jul 26, 2024

added offsetbias execute prompt and judgement process code #159

added offsetbias execute prompt and judgement process code #159

Conversation

sanghyuk-choi commented Jul 25, 2024

natolambert commented Jul 25, 2024 • edited Loading

natolambert commented Jul 25, 2024 • edited Loading

natolambert commented Jul 25, 2024

sanghyuk-choi commented Jul 26, 2024

natolambert commented Jul 26, 2024

natolambert commented Jul 25, 2024 •

edited

Loading

natolambert commented Jul 25, 2024 •

edited

Loading