OLMo: Accelerating the Science of Language Models, Dirk Groeneveld+, N/A, arXiv'24 #1250

AkihikoWatanabe · 2024-03-05T10:49:04Z

URL

Language models (LMs) have become ubiquitous in both NLP research and incommercial product offerings. As their commercial importance has surged, themost powerful models have become closed off, gated behind proprietaryinterfaces, with important details of their training data, architectures, anddevelopment undisclosed. Given the importance of these details inscientifically studying these models, including their biases and potentialrisks, we believe it is essential for the research community to have access topowerful, truly open LMs. To this end, this technical report details the firstrelease of OLMo, a state-of-the-art, truly Open Language Model and itsframework to build and study the science of language modeling. Unlike mostprior efforts that have only released model weights and inference code, werelease OLMo and the whole framework, including training data and training andevaluation code. We hope this release will empower and strengthen the openresearch community and inspire a new wave of innovation.

LMsの商業的重要性が高まる中、最も強力なモデルは閉鎖されており、その詳細が非公開になっている。そのため、本技術レポートでは、本当にオープンな言語モデルであるOLMoの初回リリースと、言語モデリングの科学を構築し研究するためのフレームワークについて詳細に説明している。OLMoはモデルの重みだけでなく、トレーニングデータ、トレーニングおよび評価コードを含むフレームワーク全体を公開しており、オープンな研究コミュニティを強化し、新しいイノベーションを促進することを目指している。

AkihikoWatanabe · 2024-03-05T10:51:27Z

Model Weightsを公開するだけでなく、training/evaluation codeとそのデータも公開する真にOpenな言語モデル（truly Open Language Model）。AllenAI

AkihikoWatanabe added action_wanted Pocket labels Mar 5, 2024

AkihikoWatanabe changed the title あ OLMo: Accelerating the Science of Language Models, Dirk Groeneveld+, N/A, arXiv'24 Mar 5, 2024

AkihikoWatanabe added NLP LanguageModel OpenSource labels Apr 17, 2024