Probing Search

This project contains code for the probing experiments in: Probing BERT for Ranking Abilities.

By performing layer-wise probing on the LLM, we can find the distribution of certain ranking related properties. This gives us an idea where ranking related information is stored in the model. We perform this analysis with the raw pre-trained model and one fine-tuned for ranking.

Distribution of properties before fine-tuning. Distribution of properties after fine-tuning.

Note: This readme serves as a quickstart guide for running experiments. For further details on the overall project structure and intended use, please visit here.

Requirements

The code is tested using python 3.7 in a virtual anaconda environment. All required packages can be found in environment.yml and installed using anaconda like so:

conda env create -f environment.yml

The packages listed can also be installed manually using pip. (Note: cudatoolkit is optional and only needed for gpu training. Same goes for wandb if you do not wish to use wandb for logging)

Experiment Configuration

This project uses hydra for composing a training configuration from multiple sub-configuration modules. All configuration files can be found in conf/ and its subdirectories, where config.yaml is the main configuration file and subdirectories contain possible sub-configurations.

Even though yaml files define the default configuration and its structure, all training parameters can be overriden via command line arguments when running the train.py script and sub-configurations can be replaced allowing easy plug and play of training components.

Examples

Replace the datamodule's default dataset configuration with the ontonotes_ner configuration -> use ontonotes_ner as dataset:

python train.py datamodule/dataset=ontonotes_ner

Sweep over learning rates (multirun):

python -m train.py loop.optimizer.lr=0.01,0.1,0.2

Do a grid search over all learning rate and batch size combinations:

python -m train.py loop.optimizer.lr=0.01,0.1 training.batch_size=32,64,128

Project Structure

The configuration structure mirrors the project's structure:

conf/
├── datamodule
│   ├── dataset
            # dataset's task type  
│   │   └── task
        # applied before passing to model
│   └── preprocessor
|
    # defines the optimization process: loop, loss, optimizer 
├── loop (pl.LightningModule)
|
    # model to be trained in loop
└ ── model (torch.nn.Module)
    # submodules for the probing-pair model
    ├── pooler
    ├── probe
    └── subject_model

The training is instantiated in train.py.

Name		Name	Last commit message	Last commit date
Latest commit History 220 Commits
analysis		analysis
conf		conf
datamodule		datamodule
logger		logger
model		model
res		res
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
extract_body.py		extract_body.py
loop.py		loop.py
metrics.py		metrics.py
preprocessor.py		preprocessor.py
procedures.py		procedures.py
train.py		train.py

yolomeus/probing-search

Folders and files

Latest commit

History

Repository files navigation

Probing Search

Requirements

Experiment Configuration

Examples

Project Structure

About

Resources

Stars

Watchers

Forks

Languages