Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update weights loading #7872

Merged
merged 9 commits into from
Aug 4, 2023
Merged

Conversation

fengwuyao
Copy link
Collaborator

Reflect the changes for saving weight to bin files PR.
This PR updates the way to parse the weights name.
To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

@fengwuyao fengwuyao marked this pull request as ready for review July 26, 2023 18:35
tfjs-layers/src/engine/container.ts Outdated Show resolved Hide resolved
tfjs-layers/src/engine/container.ts Show resolved Hide resolved
@fengwuyao fengwuyao merged commit aaa637e into tensorflow:master Aug 4, 2023
2 checks passed
pforderique added a commit that referenced this pull request Aug 7, 2023
* Implement position embedding

* Strip debug ops in jax conversion tests (#7889)

INTERNAL
This fixes an internal issue with jax tests. See cl/550054296.

* Update weights loading (#7872)

* Update weights loading

* fix tests

* remove

* fix

* fix comments

* fix lint

* Load python rules in tfjs-converter converters dir (#7892)

* Implement MultiHeadAttention Layer (#7875)

* Add spec for multi-head attention

* Add CachedMultiHeadAttention cache

* Fix typos

* Lint

* Add Transformer Decoder spec

* lint

* Add Einsum spec

* lint

* Remove unused type declaration

* Move helper functions outside EinsumDense class

* Implement Einsum Dense

* Address comments

* Implement MHA Layer

* Add masked softmax support

* Fix typo

* Check for undef and null

* Make buildFromSignature public

* Wrap softmax call in tf.tidy

* Implement position embedding

---------

Co-authored-by: Matthew Soulanille <msoulanille@google.com>
Co-authored-by: fengwuyao <131706622+fengwuyao@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants