Skip to content

For exploring the data and documenting its limitations

License

Notifications You must be signed in to change notification settings

EleutherAI/pile-explorer

Repository files navigation

Exploring the Pile

This repository contains code for exploring the Pile and documenting its limitations

Language Modeling Data Format

The data in the Pile is stored in the lm_dataformat. This repository is designed to be used on data stored in that format. For documentation, see the linked repository.

About

For exploring the data and documenting its limitations

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages