Skip to content

v0.0.3: Add support for probe training, interchange intervention training

Compare
Choose a tag to compare
@frankaging frankaging released this 12 Jan 11:52
· 296 commits to main since this release

Descriptions:

  1. Add generic activation collection intervention to collect activations for probe training. It is integrated with other interventions. You can intervene and then collect.
  2. Support interchange intervention training.
  3. More tutorials are provided.
  4. Add CONTRIBUTING.md for dev rules.
  5. Change to relative imports.