transformer_topic_model_LDA

Source code for our ICML 2023 paper How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding.

Documentation (under construction)

lda_bert_demo.ipynb: train a BERT model on LDA (topic modeling) data, and plot its attention pattern, and save other information such as attention score statistics, embedding dot products, model parameter visualizations, etc.

config/: the config files are auto-generated when you run the above iPython notebook and set the hyperparameters accordingly.

Acknowledgements

The code heavily borrows from dyck-transformer and dyckkm-learning. Thanks these authors!

Citations

If you found our paper or codes useful, please cite the paper and star this repo, thank you!

Feel free to contact yuchenl4@cs.cmu.edu if you have any questions.

@misc{li2023transformers,
  doi = {10.48550/ARXIV.2303.04245},
  url = {https://arxiv.org/abs/2303.04245},
  author = {Li, Yuchen and Li, Yuanzhi and Risteski, Andrej},
  keywords = {Machine Learning (cs.LG), Computation and Language (cs.CL), Machine Learning (stat.ML), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title = {How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding},
  publisher = {arXiv},
  year = {2023}, 
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
data		data
inspect_results/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0		inspect_results/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0
plot_attention/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0		plot_attention/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0
scripts		scripts
src		src
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
lda_bert_demo.ipynb		lda_bert_demo.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

inspect_results/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0

inspect_results/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0

plot_attention/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0

plot_attention/lda_bert_simplified_one_hot/topic10_word100_BertForMaskedLMCustom_Adam_lr0.01_wd0.0_hiddenlayers1_heads1_hiddendim104_none_one_hot_default_dropout0.0_noise0.0_mask_0.15_correct0.1_random0.1_noMany_freezeUniformAttention_freezeWdecI0

scripts

scripts

src

src

trained_models

trained_models

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

lda_bert_demo.ipynb

lda_bert_demo.ipynb

Repository files navigation

transformer_topic_model_LDA

Documentation (under construction)

Acknowledgements

Citations

About

Releases

Packages

Languages

License

YuchenLi01/transformer_topic_model_LDA

Folders and files

Latest commit

History

Repository files navigation

transformer_topic_model_LDA

Documentation (under construction)

Acknowledgements

Citations

About

Resources

License

Stars

Watchers

Forks

Languages