Add Particle Filter #673

rlouf · 2024-02-16T10:14:14Z

See Nicolas Chopin's book for a really nice introduction to the topic. The algorithms consists in carrying $N$ particles for each sequence in the batch, and at each step to:

Sample a new token for each particle using the next-token logits.
Resample the particles.

We use the multinomial resampling function in this first PR, although it is known to have very large variance. To make the implementation easier we combine (1) and (2) in a single step, similarly to what we do with beam search.

Note that there is a subtlety when doing structured generation. We can think of the simple following scheme to sample from the distribution of sequences that follow the structure:

Move particles by one step using the unbiased next-token logits;
Set the weight of each invalid particle to $-\infty$
Resample

But this can be very inefficient. Instead, we move particles using a specific proposal: using the biased next-token logits. Since this is not exactly sampling from the original distribution we need to resample the particles using the factor $P_i / \tilde{P}_i$ as a weight where $P_i$ is the unbiased probability of token $i$ and $P_i$ the biased probability of token $i$ (importance sampling).

Note: I am wondering if we should correct the Beam Search algorithm as well.

rlouf · 2024-02-29T12:28:42Z

Doing this I started to wonder if we shouldn't see and implement greedy and multinomial sampling as particular cases of more general samplers (resp. a form of beam search and a form of particle filtering).

rlouf added enhancement transformers Linked to the `transformers` integration samplers labels Feb 16, 2024

rlouf marked this pull request as draft February 16, 2024 10:56

rlouf changed the title ~~Add Sequential Monte Carlo Sampler~~ Add Particle Filter Feb 28, 2024

dottxt-ai deleted a comment from lapp0 Feb 28, 2024

rlouf mentioned this pull request Feb 28, 2024

Add integration with Hugging Face transformers #713

Closed

rlouf force-pushed the smc-sampler branch 3 times, most recently from b78e8b3 to b33e645 Compare February 29, 2024 13:46

Add Particle Filter for sequences

b33e645

rlouf closed this Jun 19, 2024

rlouf deleted the smc-sampler branch November 4, 2024 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Particle Filter #673

Add Particle Filter #673

rlouf commented Feb 16, 2024 •

edited

Loading

rlouf commented Feb 29, 2024

Add Particle Filter #673

Add Particle Filter #673

Conversation

rlouf commented Feb 16, 2024 • edited Loading

rlouf commented Feb 29, 2024

rlouf commented Feb 16, 2024 •

edited

Loading