Llama 2: Open Foundation and Fine-Tuned Chat Models, Hugo Touvron+, N/A, arXiv'23 #888

AkihikoWatanabe · 2023-07-22T09:31:48Z

URL

In this work, we develop and release Llama 2, a collection of pretrained andfine-tuned large language models (LLMs) ranging in scale from 7 billion to 70billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized fordialogue use cases. Our models outperform open-source chat models on mostbenchmarks we tested, and based on our human evaluations for helpfulness andsafety, may be a suitable substitute for closed-source models. We provide adetailed description of our approach to fine-tuning and safety improvements ofLlama 2-Chat in order to enable the community to build on our work andcontribute to the responsible development of LLMs.

この研究では、7億から70億のパラメータを持つ事前学習済みおよび微調整済みの大規模言語モデル（LLMs）のコレクションであるLlama 2を開発および公開します。
Llama 2-Chatと呼ばれる私たちの微調整済みLLMsは、対話の使用例に最適化されています。
私たちのモデルは、私たちがテストしたほとんどのベンチマークでオープンソースのチャットモデルを上回り、有用性と安全性の人間による評価に基づいて、クローズドソースのモデルの代替として適している可能性があります。
私たちは、Llama 2-Chatの微調整と安全性の改善に関するアプローチの詳細な説明を提供し、コミュニティが私たちの研究を基にして作業を進め、LLMsの責任ある開発に貢献できるようにしています。

この研究では、大規模な言語モデルであるLlama 2を開発し、微調整しています。Llama 2-Chatは対話に特化しており、オープンソースのチャットモデルを上回る性能を示しています。安全性の改善にも取り組んでおり、責任ある開発に貢献することを目指しています。

AkihikoWatanabe · 2023-07-22T09:36:43Z

AkihikoWatanabe · 2024-05-24T05:19:58Z

Llama, およびLlama2では、一般的なTransformer Decoderとは異なり、linear layerの”前に”RMSPropをかませている点が異なる。
また、Llama2では、Llamaと比較して

AkihikoWatanabe added action_wanted Pocket labels Jul 22, 2023

AkihikoWatanabe changed the title あ Llama 2: Open Foundation and Fine-Tuned Chat Models, Hugo Touvron+, N/A, arXiv'23 Jul 22, 2023

AkihikoWatanabe removed the Pocket label Jul 22, 2023

AkihikoWatanabe added NLP LanguageModel FoundationModel and removed action_wanted labels Oct 21, 2023

AkihikoWatanabe added the OpenSource label May 24, 2024