Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

epfml / disco Public

Notifications You must be signed in to change notification settings
Fork 25
Star 142

Code
Issues 53
Pull requests 5
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Improve and rework GPT-tfjs #654

Open

4 of 10 tasks

JulienVig opened this issue Mar 27, 2024 · 0 comments

Open

4 of 10 tasks

Improve and rework GPT-tfjs #654

JulienVig opened this issue Mar 27, 2024 · 0 comments

Assignees

Labels

Related to Disco.js

Code that needs to be improved

Comments

Copy link

Collaborator

JulienVig commented Mar 27, 2024 •

edited

Loading

Here is a list of potential improvements for gpt-tfjs in Disco:

Create a compile method to initialize the optimizer (rather than initializing it when fitDataset is called). This ensures the optimizer state is persisted across multiple calls to fitDataset
Rework GPT-tfjs config (learning rate, number of iteration) as Disco parameters rather than being hard-coded
Implement save and load methods to save and re-use a trained model
Rename classes for better clarity and consistency, e.g. multiple classes and functions are called GPT
Assess whenever we can use TFJS' native fitDataset method rather than overriding it with a custom training loop
Assess whether we can use tf.CustomCallbackArgs rather than redefining an interface for TrainingCallbacks
Reading a text file with TF.js only supports reading line by line which is not ideal for LLM inputs, try implementing a file reader chunk by chunk rather than by lines
To use a trained model in Disco to generate text, we have to get the model instance through the aggregator. Implement a better interface to access the language generation API.
Make sure pad tokens are ignored in the loss computation (similarly to pytorch ignoring -100 as padding token)
There is memory leak in the model disposal, one tensor per attention layer is still not disposed after calling model.dispose. Edit: the federated/decentralized mechanism also allocates new tensors every round Garbage Collecting past node contributions #683

#656 and #657 should be addressed first

The text was updated successfully, but these errors were encountered:

tharvik reacted with hooray emoji

All reactions

🎉 1 reaction

JulienVig added rework

Code that needs to be improved

discojs Related to Disco.js labels

JulienVig self-assigned this

JulienVig mentioned this issue

Add tokenization and prompting API to GPT models #651

Merged

JulienVig mentioned this issue

Fix gpt-tfjs bugs, add tests and refactor code #658

Merged

6 tasks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Assignees

Labels

Related to Disco.js

Code that needs to be improved

Projects

None yet

Milestone

No milestone

Development

No branches or pull requests

1 participant

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.