[Chatllama] Merge the datasets to create more insightful training data #321

PierpaoloSorbellini · 2023-03-31T13:30:52Z

Description

Currently the dataset supported can be used alternatively to each other,
It would be nice to add diversity in the training data to select recipes to merge this dataset and create more insightful trainings.

TODO

Define what parameters needs to be specified to create a "recipe" for the dataset and add them to the config files
Expand the dataset class to allow parameters from the config file to generate the appropriate dataset mixture.
Evaluate the possible increase in model quality due to different "recipes" used.

mohsinmahmood12 · 2023-04-18T22:29:17Z

Hi there! I want to work on this issue

diegofiori · 2023-04-19T06:01:07Z

Hello @mohsinmahmood12, thanks a lot for the interest in ChatLLaMA. I assigned the issue to you! Let us know if you face any difficulties with the task 😄

PierpaoloSorbellini added good first issue Good for newcomers chatllama Issue related to the ChatLLaMA module labels Mar 31, 2023

diegofiori assigned mohsinmahmood12 Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chatllama] Merge the datasets to create more insightful training data #321

[Chatllama] Merge the datasets to create more insightful training data #321

PierpaoloSorbellini commented Mar 31, 2023

mohsinmahmood12 commented Apr 18, 2023

diegofiori commented Apr 19, 2023

[Chatllama] Merge the datasets to create more insightful training data #321

[Chatllama] Merge the datasets to create more insightful training data #321

Comments

PierpaoloSorbellini commented Mar 31, 2023

Description

TODO

mohsinmahmood12 commented Apr 18, 2023

diegofiori commented Apr 19, 2023