Skip to content

lingoiitgn/TempUN

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

TempUN Dataset

We introduce TempUN, an extensive and balanced dataset ranging from 10,000 BCE to 2100 CE. The dataset was obtained through web scraping from the following Our World in Data is curated based on global issues and priorities as delineated by the United Nations and EU. TempUN mitigates the bias towards immutable facts found in TempLAMA; only 16.13% of its facts remain unaltered, resulting in 83.87% of the facts being subject to change. TempUN encompasses approximately 462,894 records, from which a substantial number of temporal prompts, 9,497,502 (denoted as Large), have been derived. However, to accommodate the computational constraints of larger models, we also offer a compressed version of the dataset, which consists of a random selection of 1,907 instances corresponding to 104,130 prompts (referred to as Small).

Dataset Link: Drive

More Details in the paper (Link Coming soon)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published