Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

百川2 的max-z loss,为啥代码里只有前向部分,没有反向部分? #54

Closed
flower-with-safe opened this issue Oct 20, 2023 · 3 comments

Comments

@flower-with-safe
Copy link

我们团队自己实现了一个,可以一起讨论一下:
baichuan-inc/Baichuan2#220

@jerryli1981
Copy link
Collaborator

好的,收到。可以加入钉钉群,然后加下我的钉钉,我们一起开个小群聚焦下max z loss

@butterluo
Copy link

我们团队自己实现了一个,可以一起讨论一下: baichuan-inc/Baichuan2#220

貌似你forward的实现和baichuan2原版的实现不一样?

@flower-with-safe
Copy link
Author

我们团队自己实现了一个,可以一起讨论一下: baichuan-inc/Baichuan2#220

貌似你forward的实现和baichuan2原版的实现不一样?

哪里不一样了呀?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants