Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Script for data preparation from CSVs #329

Closed
aniketmaurya opened this issue Jul 28, 2023 · 3 comments · Fixed by #462
Closed

Script for data preparation from CSVs #329

aniketmaurya opened this issue Jul 28, 2023 · 3 comments · Fixed by #462
Labels
enhancement New feature or request fine-tuning

Comments

@aniketmaurya
Copy link
Contributor

aniketmaurya commented Jul 28, 2023

Example:

Instruction Input Output
What is Llama Answer based on LLMs Llama is a foundation model from Meta AI
python scripts/prepare_csv.py --csv_path data.csv
@carmocca carmocca added enhancement New feature or request fine-tuning labels Aug 14, 2023
@Anindyadeep
Copy link
Contributor

Anindyadeep commented Aug 24, 2023

@aniketmaurya, Hey if this issue is Open, I would like to contribute to this issue. I have some questions, After running the script what kind of output is expected here (A .jsonl file? saved inside a path provided by the user with automatic train/test split)?

@aniketmaurya
Copy link
Contributor Author

Hi @Anindyadeep, the output format should be similar to the prepare_alpaca.py script.

@Anindyadeep
Copy link
Contributor

Anindyadeep commented Aug 24, 2023

@aniketmaurya I pushed a PR #462 for this. Let me know if I need to make more changes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request fine-tuning
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants