How to fine tune on massive dataset (10k + dialog lines) #2227

atl333 · 2021-12-30T02:01:51Z

So i have a few 100k+ datasets, and i made some 10k line ones trying to finetune several models but so far this is the only high level programming project where I have seen full documentation. Is it possible to fine tune my bot to, for example, reply on topics about politics with 10k+ lines of politics conversations, just for fun of course, I don't expect it to come up with anything substantial xD

I have an RTX 2060 and CUDA 11.3, pythorch, transformers, cudnn 8, etc

brightening-eyes · 2022-01-06T12:18:33Z

yes.
write a logic adapter, in it, get the text, preprocess it, feed it to your neural network and return the output.

MrChadMWood · 2022-01-31T19:41:28Z

So i have a few 100k+ datasets

Any chance you'd be willing to post these?

Yaso2Go · 2022-03-20T18:08:31Z

yes. write a logic adapter, in it, get the text, preprocess it, feed it to your neural network and return the output.

Can you please show us an example to do so

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to fine tune on massive dataset (10k + dialog lines) #2227

How to fine tune on massive dataset (10k + dialog lines) #2227

atl333 commented Dec 30, 2021

brightening-eyes commented Jan 6, 2022

MrChadMWood commented Jan 31, 2022

Yaso2Go commented Mar 20, 2022

How to fine tune on massive dataset (10k + dialog lines) #2227

How to fine tune on massive dataset (10k + dialog lines) #2227

Comments

atl333 commented Dec 30, 2021

brightening-eyes commented Jan 6, 2022

MrChadMWood commented Jan 31, 2022

Yaso2Go commented Mar 20, 2022