Skip to content

Understanding MaskedMLP #30

Answered by francois-rozet
zqy767 asked this question in Q&A
Nov 21, 2023 · 2 comments · 11 replies
Discussion options

You must be logged in to vote

Hello @zqy767, this a very good question! I'll start by explaining why MaskedMLP is necessary in autoregressive transformations and then how it is implemented in Zuko. I also invite you to take a look at discussion #16, whose subject is related.

Why

Let $x$ be a vector in $\mathbb{R}^n$. An autoregressive transformation is a mapping $y = f(x) \in \mathbb{R}^n$ such that the $i$-th element of $y$ is a bijective univariate transformation of the $i$-th element of $x$, conditioned on the preceding elements. That is $y_i = f_i(x_i \mid h_i(x_{1:i-1}))$ where $x_{1:i} = (x_1, x_2, \dots, x_i)$ and $h_i$ returns the parameters of the univariate transformation $f_i$. $h_i$ is typically a neural n…

Replies: 2 comments 11 replies

Comment options

You must be logged in to vote
3 replies
@zqy767
Comment options

@francois-rozet
Comment options

@zqy767
Comment options

Comment options

You must be logged in to vote
8 replies
@CaioDaumann
Comment options

@francois-rozet
Comment options

@francois-rozet
Comment options

@CaioDaumann
Comment options

@francois-rozet
Comment options

Answer selected by francois-rozet
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants