Add alpaca reverse augmentation possibility #2342

CloseChoice · 2023-04-06T05:53:47Z

Add possiblity to reverse question and response for alpaca dataset
add debug configs to show how to used

Alpaca with reverse augmentation can be run with:

python trainer_sft.py --configs pythia-70m-deduped

I couldn't run this due to unscale gradient errors, so before this gets merged we should do a run to check whether this reversal really improves the loss.

…e/reverse-augmentation-alpaca

github-actions · 2023-04-06T05:56:36Z

❌ pre-commit failed.
Please run pre-commit run --all-files locally and commit the changes.
Find more information in the repository's CONTRIBUTING.md

CloseChoice · 2023-04-06T06:57:42Z

Eval dataset is still missing

CloseChoice · 2023-04-06T06:59:54Z

One thing that is not quite clear to me is if we should the reverse augmentation samples ADDITIONALLY to the train set

andreaskoepf · 2023-04-06T07:20:39Z

One thing that is not quite clear to me is if we should the reverse augmentation samples ADDITIONALLY to the train set

Yes, I think forward + reverse could both be part of the training set. Ideally the same examples of the alpaca set should be used for evaluation of both forward/reverse (in order to avoid leaking of eval-data of the other dataset into the train set).

andreaskoepf · 2023-04-06T07:21:29Z

I couldn't run this due to unscale gradient errors, so before this gets merged we should do a run to check whether this reversal really improves the loss.

For the smaller models it is necessary to specify dtype: float32 in the configuration to avoid this error.

CloseChoice · 2023-04-06T08:10:34Z

Added the eval set and also the option to keep the unreversed data in the training set.
Note that if we reverse and keep the unreversed aswell, that the val_split size does not match the quotient len(val) / (len(train) + len(val)) anymore. Don't think that this is an issue though.

andreaskoepf

Very nice, thank you. IMO the now 'blood-empty' AlpacaDataset & CodeAlpacaDataset could be removed.

andreaskoepf · 2023-04-06T18:16:10Z

model/model_training/custom_datasets/qa_datasets.py

        super().__init__()
+        self.data = data
+        if mode not in ["sft", "rl"]:


could be tuple instead of list

andreaskoepf · 2023-04-06T18:17:32Z

model/model_training/custom_datasets/qa_datasets.py

-class Alpaca(AlpacaBase):
-    def __init__(self, mode: str = "sft", cache_dir: str = None) -> None:
-        super().__init__(dataset_name="yahma/alpaca-cleaned", mode=mode, cache_dir=cache_dir)
+class AlpacaDataset(AlpacaBaseDataset):


If these classes don't add any functionality they could be removed (i.e. base class be used directly).

andreaskoepf · 2023-04-06T18:18:17Z

model/model_training/custom_datasets/qa_datasets.py

+    manual_seed: int = 287631038922,
+    reverse_augmentation: bool = False,
+    keep_unreversed: bool = True,
+) -> tuple[AlpacaDataset, AlpacaDataset] | tuple[CodeAlpacaDataset, CodeAlpacaDataset]:


base class type annotation for factory function would be better imo

model/model_training/custom_datasets/__init__.py

model/model_training/configs/config.yaml

andreaskoepf

lgtm!

CloseChoice added 3 commits April 6, 2023 07:48

add alpaca reverse augmentation functionality

41a24d8

Merge branch 'main' of github.com:LAION-AI/Open-Assistant into featur…

4d6eee2

…e/reverse-augmentation-alpaca

remove debug statement

ebd0a62

update qa datasets and config

076030a

CloseChoice added 2 commits April 6, 2023 09:58

update alpaca datasets to include train and eval

eb07ecf

add keep_unreversed keyword

42877bb

CloseChoice added 2 commits April 6, 2023 10:17

use different classnames for alpaca and codealpaca

3939377

fix issues

6895461

CloseChoice marked this pull request as ready for review April 6, 2023 08:30

CloseChoice requested review from theblackcat102, sanagno, dvruette, andreaskoepf and yk as code owners April 6, 2023 08:30

andreaskoepf added the ml label Apr 6, 2023

andreaskoepf requested changes Apr 6, 2023

View reviewed changes

updates due to PR discussions

d447a68

CloseChoice requested a review from andreaskoepf April 7, 2023 06:40

Merge branch 'main' into feature/reverse-augmentation-alpaca

665695e

andreaskoepf approved these changes Apr 7, 2023

View reviewed changes

andreaskoepf enabled auto-merge (squash) April 7, 2023 23:45

andreaskoepf merged commit f8c1cd2 into main Apr 7, 2023
1 check passed

andreaskoepf deleted the feature/reverse-augmentation-alpaca branch April 7, 2023 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add alpaca reverse augmentation possibility #2342

Add alpaca reverse augmentation possibility #2342

CloseChoice commented Apr 6, 2023

github-actions bot commented Apr 6, 2023

CloseChoice commented Apr 6, 2023

CloseChoice commented Apr 6, 2023

andreaskoepf commented Apr 6, 2023

andreaskoepf commented Apr 6, 2023

CloseChoice commented Apr 6, 2023 •

edited

andreaskoepf left a comment

andreaskoepf Apr 6, 2023

CloseChoice Apr 7, 2023

andreaskoepf Apr 6, 2023

CloseChoice Apr 7, 2023

andreaskoepf Apr 6, 2023

CloseChoice Apr 7, 2023

andreaskoepf left a comment

Add alpaca reverse augmentation possibility #2342

Add alpaca reverse augmentation possibility #2342

Conversation

CloseChoice commented Apr 6, 2023

github-actions bot commented Apr 6, 2023

CloseChoice commented Apr 6, 2023

CloseChoice commented Apr 6, 2023

andreaskoepf commented Apr 6, 2023

andreaskoepf commented Apr 6, 2023

CloseChoice commented Apr 6, 2023 • edited

andreaskoepf left a comment

Choose a reason for hiding this comment

andreaskoepf Apr 6, 2023

Choose a reason for hiding this comment

CloseChoice Apr 7, 2023

Choose a reason for hiding this comment

andreaskoepf Apr 6, 2023

Choose a reason for hiding this comment

CloseChoice Apr 7, 2023

Choose a reason for hiding this comment

andreaskoepf Apr 6, 2023

Choose a reason for hiding this comment

CloseChoice Apr 7, 2023

Choose a reason for hiding this comment

andreaskoepf left a comment

Choose a reason for hiding this comment

CloseChoice commented Apr 6, 2023 •

edited