- Get training data.
- 'Spoiled' data, so that adjective's gender is broken.
- Trained two Russian BERT models.
- Marked data from rusenteval with stanza either per sent or per word.
- Conducted 3 types of probing experiments (by CLS token, by mean sentence embedding and per token), largely relying on NeuroX.
- Compared the results.
There are src files for probing and an example notebook.