Skip to content

aalto-speech/Wav2vec2Interpretation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wav2vec2Interpretation

Scripts and additional images for article "Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model"

Umap visualization of the embeddings

Animated picture of the visual interpretation

Visualization of the CNN outputs

You can find the visualizations of the embeddings produced by the CNN component in pics/cnn_*.svg

Visualization of the pretrained Transformer's outputs

You can find the high res visualizations of the embeddings produced by the pre-trained Transformer component in pics/pre_*.[svg/eps]

Visualization of the finetuned Transformer's outputs

You can find the high res visualizations of the embeddings produced by the fine-tuned Finnish Transformer component in pics/fine_*.[svg/eps]

Gender, age and speaker information

Pictures marked with utt2age show how the age information is embedded in the models, utt2speaker files demonstrate how well the models could differentiate between speakers and utt2gender visualizes the gender information in the embeddings

About

scripts and images for article "Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages