Skip to content

Latest commit

 

History

History
18 lines (9 loc) · 594 Bytes

ontrastive_circuit_sharpening.md

File metadata and controls

18 lines (9 loc) · 594 Bytes

contrastive circuit sharpening

inspiration

analogy

  • to human learning: when we get the answer wrong, it helps a lot to be told what we could've done differently.

procedure

  1. for a given contrastive target, generate multiple samples.
  2. rank them and use a continuous transform/approximation of that ranking as a contrastive loss. this should be structured such that only the single best or handful of really good samples are given good scores