Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gemma: Open Models Based on Gemini Research and Technology, 2024 #1277

Open
AkihikoWatanabe opened this issue Apr 8, 2024 · 2 comments
Open

Comments

@AkihikoWatanabe
Copy link
Owner

AkihikoWatanabe commented Apr 8, 2024

https://storage.googleapis.com/deepmind-media/gemma/gemma-report.pdf

blog: https://blog.google/technology/developers/gemma-open-models/

@AkihikoWatanabe
Copy link
Owner Author

AkihikoWatanabe commented May 24, 2024

アーキテクチャはTransformer Decoderを利用。モデルのサイズは2Bと7B。
オリジナルのTransformer Decoderアーキテクチャから、下記改善を実施している:

image

@AkihikoWatanabe
Copy link
Owner Author

AkihikoWatanabe commented May 24, 2024

Mistral #1309 よりも高い性能を示している:
image
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant