Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
-
Updated
Aug 7, 2024 - Python
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
Explainability of Deep Learning Models
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
Projet refait entièrement dans la v2 web
Add a description, image, and links to the intervention topic page so that developers can more easily learn about it.
To associate your repository with the intervention topic, visit your repo's landing page and select "manage topics."