Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to fine tune on massive dataset (10k + dialog lines) #2227

Open
atl333 opened this issue Dec 30, 2021 · 3 comments
Open

How to fine tune on massive dataset (10k + dialog lines) #2227

atl333 opened this issue Dec 30, 2021 · 3 comments

Comments

@atl333
Copy link

atl333 commented Dec 30, 2021

So i have a few 100k+ datasets, and i made some 10k line ones trying to finetune several models but so far this is the only high level programming project where I have seen full documentation. Is it possible to fine tune my bot to, for example, reply on topics about politics with 10k+ lines of politics conversations, just for fun of course, I don't expect it to come up with anything substantial xD

I have an RTX 2060 and CUDA 11.3, pythorch, transformers, cudnn 8, etc

@brightening-eyes
Copy link

yes.
write a logic adapter, in it, get the text, preprocess it, feed it to your neural network and return the output.

@MrChadMWood
Copy link

So i have a few 100k+ datasets

Any chance you'd be willing to post these?

@Yaso2Go
Copy link

Yaso2Go commented Mar 20, 2022

yes. write a logic adapter, in it, get the text, preprocess it, feed it to your neural network and return the output.

Can you please show us an example to do so

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants