Skip to content

Latest commit

 

History

History
69 lines (33 loc) · 7.2 KB

datacomp_models.md

File metadata and controls

69 lines (33 loc) · 7.2 KB

CommonPool and DataComp models

As part of DataComp, we trained models on CommonPool using various data filtering strategies. We release models for all four scales of the competition, small, medium, large and xlarge, corresponding to a pool size and number of samples seen of 12.8M, 128M, 1.28B and 12.8B, respectively.

The models are specified below, see our paper DataComp: In seearch of the next generation of multimodal datasets for more details.

xlarge scale models

large scale models

medium scale models

small scale models