Sequence to Sequence Problem Type #308

psinger · 2023-07-24T12:45:19Z

Closes #100

Adds a new problem Type Sequence to Sequence Modeling. For now tested for different T5 models.
We can probably go step by step adding more functionality if this will be frequently used.

I tried to reuse as much as possible to not have much duplicate code.

@maxjeblick please take an initial look, it might need 1-2 iterations

Todo:

maxjeblick

Thanks a lot for adding this problem type, it looks in a very good state!

I would move prepare_lora and generate to separate functions to avoid code duplication (also, they will probably be needed for other potential problem types as well).
Apart from that, the code looks fine, will test it more the next days.

psinger · 2023-07-28T10:12:21Z

Thx for the comments @maxjeblick - tried to address them.

llm_studio/src/utils/config_utils.py

maxjeblick

Thanks for the changes! Some observations:

Summary tab fails with AttributeError: 'ConfigProblemBase' object has no attribute 'hf'
Train Data Insights currently shows decoded input ids, whereas the model uses prompt input ids. Maybe it is useful in general to show prompt input ids, instead of input ids? Could potentially also changed in Max/insights table view #301.
Mid-term, promoting generate and prepare_lora to dedicated functions (with corresponding arguments, cfg, backbone, ...) may be useful. We can leave it here as is and I can have a look in [CODE IMPROVEMENT] Promote RLHF as a separate problem type #317.

…/seq2seq

maxjeblick · 2023-08-03T10:16:10Z

The PR is probably good to merge after hf has been added to the config.

psinger · 2023-08-03T11:27:20Z

Yes, I just noticed that I also need to adjust the summary model card now. On it.

…/seq2seq

maxjeblick

LGTM thanks!

One small issue:
trust_remote_code is not populated in the model card, should be easy to fix.

psinger and others added 10 commits July 24, 2023 10:17

init

89a4880

Merge branch 'main' into psi/seq2seq

8387d43

init

4721ce8

format

9b4506e

minor

85e5494

tooltips

4f5bf7a

updates

260f571

model card etc

dc21f1a

format

b5c477d

Merge branch 'main' into psi/seq2seq

e39b04a

psinger requested a review from maxjeblick July 25, 2023 12:49

format

745fa99

psinger marked this pull request as ready for review July 25, 2023 12:54

maxjeblick reviewed Jul 27, 2023

View reviewed changes

psinger added 2 commits July 28, 2023 09:25

Merge branch 'main' into psi/seq2seq

79236cd

changes and style

b333c1d

maxjeblick reviewed Jul 31, 2023

View reviewed changes

llm_studio/src/utils/config_utils.py Outdated Show resolved Hide resolved

maxjeblick suggested changes Jul 31, 2023

View reviewed changes

maxjeblick mentioned this pull request Aug 1, 2023

Refactor rlhf #328

Merged

psinger and others added 3 commits August 2, 2023 07:35

Merge branch 'main' into psi/seq2seq

1051603

Merge branch 'psi/seq2seq' of github.com:h2oai/h2o-llmstudio into psi…

692f8b9

…/seq2seq

Merge branch 'main' into psi/seq2seq

abe16bd

psinger added 3 commits August 3, 2023 12:13

summary stuff

0cd2d99

format

800ac92

Merge branch 'psi/seq2seq' of github.com:h2oai/h2o-llmstudio into psi…

39d34e2

…/seq2seq

psinger requested a review from maxjeblick August 3, 2023 12:15

maxjeblick approved these changes Aug 3, 2023

View reviewed changes

adjustments

de728bd

psinger merged commit 454b8df into main Aug 4, 2023
5 checks passed

psinger deleted the psi/seq2seq branch August 4, 2023 13:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequence to Sequence Problem Type #308

Sequence to Sequence Problem Type #308

psinger commented Jul 24, 2023 •

edited

Loading

maxjeblick left a comment

psinger commented Jul 28, 2023

maxjeblick left a comment

maxjeblick commented Aug 3, 2023

psinger commented Aug 3, 2023

maxjeblick left a comment

Sequence to Sequence Problem Type #308

Sequence to Sequence Problem Type #308

Conversation

psinger commented Jul 24, 2023 • edited Loading

maxjeblick left a comment

Choose a reason for hiding this comment

psinger commented Jul 28, 2023

maxjeblick left a comment

Choose a reason for hiding this comment

maxjeblick commented Aug 3, 2023

psinger commented Aug 3, 2023

maxjeblick left a comment

Choose a reason for hiding this comment

psinger commented Jul 24, 2023 •

edited

Loading