What format was used for the final fine-tuning of LLaMA2-7B in RAFT? #444

Leo4zhou · 2024-05-24T09:33:44Z

In RAFT, the jsonl file format after conversion using "format.py" is as follows:
{"messages":[{"content":"When was the University of California, Berkeley established?","role":"user"},{"content":"To answer the question, we need to find the information related to the establishment date of the University of California, Berkeley in the provided context. \n\nThe relevant information is found in the line ##begin_quote##"Established March 23, 1868[1]"##end_quote##. This line directly states the date when the University of California, Berkeley was established.\n\n: The University of California, Berkeley was established on March 23, 1868.","role":"assistant"}]}

But the file format used for fine-tuning in the paper is as follows, which centrally contains question, context, instruction and CoT Answer:

What format was used for the final fine-tuning of LLaMA2-7B in RAFT? Does it include context or other content?

ShishirPatil · 2024-06-05T21:59:19Z

Hey @Leo4zhou if you want an end to end example, you can check out this tutorial https://build.microsoft.com/en-US/sessions/78e4ef86-4443-4cb6-a0b4-157a6623fe87 and the code base is at aka.ms/raft-build24. Please reopen / start a new issue if you happen to have any follow-ups.

ShishirPatil closed this as completed Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What format was used for the final fine-tuning of LLaMA2-7B in RAFT? #444

What format was used for the final fine-tuning of LLaMA2-7B in RAFT? #444

Leo4zhou commented May 24, 2024

ShishirPatil commented Jun 5, 2024

What format was used for the final fine-tuning of LLaMA2-7B in RAFT? #444

What format was used for the final fine-tuning of LLaMA2-7B in RAFT? #444

Comments

Leo4zhou commented May 24, 2024

ShishirPatil commented Jun 5, 2024