想保留原有的对话能力并增加现有的问题处理对话哪种更适合呢？lora还是ptuning？？？我还有个疑问，#413 说到ptuning微调之后就只支持当前任务了，这种同样是对话的任务微调之后之前的对话能力是否也会变差？如果想保留原有的对话能力并增加现有的问题处理对话是不是使用lora更适合？ #542

cristianohello · 2023-04-12T06:28:26Z

Is there an existing issue for this?

I have searched the existing issues

Current Behavior

想保留原有的对话能力并增加现有的问题处理对话哪种更适合呢？lora还是ptuning？？？

我还有个疑问，#413 说到ptuning微调之后就只支持当前任务了，这种同样是对话的任务微调之后之前的对话能力是否也会变差？如果想保留原有的对话能力并增加现有的问题处理对话是不是使用lora更适合？

Expected Behavior

想保留原有的对话能力并增加现有的问题处理对话哪种更适合呢？lora还是ptuning？？？

我还有个疑问，#413 说到ptuning微调之后就只支持当前任务了，这种同样是对话的任务微调之后之前的对话能力是否也会变差？如果想保留原有的对话能力并增加现有的问题处理对话是不是使用lora更适合？

Steps To Reproduce

想保留原有的对话能力并增加现有的问题处理对话哪种更适合呢？lora还是ptuning？？？

我还有个疑问，#413 说到ptuning微调之后就只支持当前任务了，这种同样是对话的任务微调之后之前的对话能力是否也会变差？如果想保留原有的对话能力并增加现有的问题处理对话是不是使用lora更适合？

Environment

- OS:
- Python:
- Transformers:
- PyTorch:
- CUDA Support (`python -c "import torch; print(torch.cuda.is_available())"`) :

Anything else?

想保留原有的对话能力并增加现有的问题处理对话哪种更适合呢？lora还是ptuning？？？

我还有个疑问，#413 说到ptuning微调之后就只支持当前任务了，这种同样是对话的任务微调之后之前的对话能力是否也会变差？如果想保留原有的对话能力并增加现有的问题处理对话是不是使用lora更适合？

xxllp · 2023-04-13T12:44:45Z

我感觉无论是lora还是ptuning都会存在历史遗忘的问题，因为这二者本身还是微调的变形罢了

YYGe01 · 2023-04-14T01:51:32Z

实测，ptuning遗忘的很多，建议用lora，并且训练次数不能太多。

hanswang73 · 2023-04-14T02:28:29Z

实测，ptuning遗忘的很多，建议用lora，并且训练次数不能太多。

lora会遗忘吗？

YYGe01 · 2023-04-14T03:29:51Z

就算你从头到尾所有参数全部微调，也会有遗忘，lora相比ptuning会好点，但是ptuning做特定任务效果会好点。

hanswang73 · 2023-04-14T03:43:32Z

就算你从头到尾所有参数全部微调，也会有遗忘，lora相比ptuning会好点，但是ptuning做特定任务效果会好点。

多谢！

songsa1 · 2023-04-17T07:43:15Z

实测，ptuning遗忘的很多，建议用lora，并且训练次数不能太多

实测也会忘，好像不能步数太多

FrankWhh · 2023-04-18T11:35:13Z

实测，ptuning遗忘的很多，建议用lora，并且训练次数不能太多

实测也会忘，好像不能步数太多

但是步数少，感觉新东西学得不好，不知道是不是lora参数选的不对

cywjava · 2023-04-22T04:13:56Z

Lora 训练新知识，我试了一下几千步就可以了，要是几万步，反而推理结果更差

tqjack · 2023-04-25T00:48:00Z

你们batchsize都多大，accumulate是几

Vector-Cross · 2023-04-25T11:11:17Z

用ptuning，8000条数据，训练epoch到了5点几，感觉调的有点呆了

sun1092469590 · 2023-05-13T08:25:40Z

就算你从头到尾所有参数全部微调，也会有遗忘，lora相比ptuning会好点，但是ptuning做特定任务效果会好点。

你ptuning时用了多少条数据效果或不错？

energy888666 · 2023-08-04T07:16:30Z

那到底如何控制这个遗忘呢, 我是万全按照他[P-Tuning v2] 的微调参数都没动

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cristianohello commented Apr 12, 2023

xxllp commented Apr 13, 2023

YYGe01 commented Apr 14, 2023

hanswang73 commented Apr 14, 2023

YYGe01 commented Apr 14, 2023

hanswang73 commented Apr 14, 2023

songsa1 commented Apr 17, 2023

FrankWhh commented Apr 18, 2023

cywjava commented Apr 22, 2023

tqjack commented Apr 25, 2023

Vector-Cross commented Apr 25, 2023

sun1092469590 commented May 13, 2023

energy888666 commented Aug 4, 2023

Comments

cristianohello commented Apr 12, 2023

Is there an existing issue for this?

Current Behavior

Expected Behavior

Steps To Reproduce

Environment

Anything else?

xxllp commented Apr 13, 2023

YYGe01 commented Apr 14, 2023

hanswang73 commented Apr 14, 2023

YYGe01 commented Apr 14, 2023

hanswang73 commented Apr 14, 2023

songsa1 commented Apr 17, 2023

FrankWhh commented Apr 18, 2023

cywjava commented Apr 22, 2023

tqjack commented Apr 25, 2023

Vector-Cross commented Apr 25, 2023

sun1092469590 commented May 13, 2023

energy888666 commented Aug 4, 2023