Time is Encoded in the Weights of Finetuned Language Models

We release three language modeling datasets, over 500 time-specific models, and scripts for reproducing the main paper results.

Data

We provide processed WMT (2012-2020), Twitter (2015-2020), and arXiv (2006-2020) yearly language modeling splits, and monthly WMT splits (Jan.2012-Dec.2020) on Huggingface.

We use the following processed downstream tasks from "Time Waits for No One!" (Luu et al., 2022):

Newsroom Summarization (NewsSum)
Newsroom Source Classification (NewsCls)
Tweet Political Affiliation Classification (PoliAff)
AI Publisher Classification (AIC)

To use the NewsSum and NewsCls tasks, first download the Newsroom dataset, then process with the script from the "Time Waits for No One!" repo.

PoliAff text is omitted due to the Twitter License Agreement, but Luu et al. provide labels and tweet IDs in their repo.

AIC splits are also available at Luu et al.'s repo.

Models

We release all T5 models finetuned with time-specific data on Huggingface.

Yearly downstream models are labeled as KaiNylund/t5-{t5-size}-{task}-{time}. For example, to load the T5-large model finetuned on 2018 PoliAff data, run:

from transformers import AutoModelForSeq2SeqLM
model = AutoModelForSeq2SeqLM.from_pretrained("KaiNylund/t5-770M-poli_aff-2018")

For language modeling tasks, the format is KaiNylund/t5-{t5-size}-lm-{dataset}-{time}. For instance, T5-small finetuned on October 2016 WMT language modeling is at KaiNylund/t5-60M-lm-wmt-2016-9

We also provide GPT2-small and XGLM-564M finetuned on yearly and monthly english WMT data (and german WMT data for XGLM), although we do not cover the finetuning process for these models in our paper.

Reproducing Experiments

We provide scripts to reproduce individual experiments from the paper in experiment_scripts on a single GPU, and a file with usage examples. Unfortunately, because we do not directly release the downstream task datasets, running most files will require downloading an external dataset and then updating paths to training or evaluation files.

For example, to reproduce the T5-small task analogy experiments for NewsSum + WMT LM:

Install the conda environment with conda env create -f environment.yml
Download and process the newsroom summarization dataset into yearly json evaluation splits with "text" and "summary" fields
Update the lines news_sum_eval_dir = "" and eval_file="${news_sum_eval_dir}${eval_year}" in time_vec_analogies.sh
Run bash ./experiment_scripts/time_vec_analogies.sh

Due to the large number of evaluations (particularly for the monthly decay heatmap and time vector analogy alpha sweeps), we recommend running experiments in paralell. As a starting point, we provide our unorganized slurm scripts in misc_slurm_jobs, although using these will require updating the file structure and slurm account information in slurm_constants.py.

Reference

@article{nylund2023time,
  title={Time is Encoded in the Weights of Finetuned Language Models},
  author={Nylund, Kai and Gururangan, Suchin and Smith, Noah A},
  journal={arXiv preprint arXiv:2312.13401},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
experiment_scripts		experiment_scripts
finetuning_scripts		finetuning_scripts
misc_analysis_and_figures		misc_analysis_and_figures
misc_slurm_jobs		misc_slurm_jobs
results		results
task_vectors		task_vectors
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
reproduce_all_results.sh		reproduce_all_results.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment_scripts

experiment_scripts

finetuning_scripts

finetuning_scripts

misc_analysis_and_figures

misc_analysis_and_figures

misc_slurm_jobs

misc_slurm_jobs

results

results

task_vectors

task_vectors

.DS_Store

.DS_Store

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

environment.yml

environment.yml

reproduce_all_results.sh

reproduce_all_results.sh

Repository files navigation

Time is Encoded in the Weights of Finetuned Language Models

Data

Models

Reproducing Experiments

Reference

About

Releases

Packages

Contributors 2

Languages

License

KaiNylund/lm-weights-encode-time

Folders and files

Latest commit

History

Repository files navigation

Time is Encoded in the Weights of Finetuned Language Models

Data

Models

Reproducing Experiments

Reference

About

Resources

License

Stars

Watchers

Forks

Languages