TransformersConceptNet

Code and Data Associated with Can LLMs facilitate interpretation of pre-trained language models?

Data

The data can be downloaded from the project page

Data Description

We use agglomerative hierarchical clustering over contextualized representations and then annotate these concepts using GPT annotations. Our study involved several 12-layered transformer models, including BERT-cased, RoBERTa, XLNet, and ALBERT. For more details, please refer to the paper

Load data into display

Clone this repository:

git clone https://github.com/qcri/TransformersConceptNet.git
cd TransformersConceptNet

Create and activate virtual environment:

python -m venv .envs/tcn
source .envs/tcn/bin/activate

Install requirements:

pip install -r requirements.txt

Start the webapp using the following command:

python -u app.py -d <path-to-downloaded-data>

and visit http://localhost:8080 in your browser. The port and hostname can be passed as additional arguments to app.py.

Concept probing and Neuron Analysis

We used the NeuroX package to train linear probes and perform neuron analysis. You can view the documentation of the package here

Todo

Add cross architectural comparison between the models in the display

Citation

If you used the dataset please cite

@inproceedings{mousi2023llms,
      title = "Can LLMs Facilitate Interpretation of Pre-trained Language 
      Models?",
      author = "Mousi, Basel  and
      Durrani, Nadir  and 
      Dalvi, Fahim", 
      booktitle = "Proceedings of the 2023 Conference on Empirical Methods 
      in Natural Language Processing",
      month = dec, 
      year = "2023", 
      publisher = "Association for Computational Linguistics", 
      url = "https://browse.arxiv.org/pdf/2305.13386.pdf"
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

static

static

templates

templates

.gitignore

.gitignore

README.md

README.md

app.py

app.py

requirements.txt

requirements.txt

utils.py

utils.py

Repository files navigation

TransformersConceptNet

Data

Data Description

Load data into display

Concept probing and Neuron Analysis

Todo

Citation

About

Releases

Packages

Contributors 2

Languages

qcri/TransformersConceptNet

Folders and files

Latest commit

History

Repository files navigation

TransformersConceptNet

Data

Data Description

Load data into display

Concept probing and Neuron Analysis

Todo

Citation

About

Resources

Stars

Watchers

Forks

Languages