Why does the normalization apply only to the input? #1

yaoweihu · 2019-12-12T18:38:04Z

Hello, I see your code, and I find that you only apply log and normalization to the in_times? I don't understand why not apply them to in_times and out_times?

shchur · 2019-12-16T10:54:32Z

Since we are using log-likelihood as the evaluation metric, applying transformations to inter-event times (e.g. scaling / logartihm) will change the results. The log-likelihood is defined as \sum_i \log p^*(\tau_i), where p^*(\tau_i) is the conditional density at point \tau_i - we are summing log-densities for all the samples in the dataset. If we transform all the inter-event times \tau_i (e.g. scale / apply log), the densities will also get changed according to the change of variables formula.

Instead, we do the following. All the models considered in our paper are defined as normalizing flows (i.e. a sequence of transformations of a base density).

z_0 ~ p(z_0)
z_1 = g_1(z_0)
...
z_M = g_M(z_{M-1})
tau = f(z_M)

As the final transformation (f in the pseudocode above) we apply scaling / exponentiation (see decoders.py). This way

we obtain a distribution over original inter-event times
the models are easier to train, since the distribution over z_M should have 0 mean and unit variance.

Let me know if this explanation doesn't make sense to you and I will try to clarify it.

shchur closed this as completed May 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does the normalization apply only to the input? #1

Why does the normalization apply only to the input? #1

yaoweihu commented Dec 12, 2019 •

edited

Loading

shchur commented Dec 16, 2019

Why does the normalization apply only to the input? #1

Why does the normalization apply only to the input? #1

Comments

yaoweihu commented Dec 12, 2019 • edited Loading

shchur commented Dec 16, 2019

yaoweihu commented Dec 12, 2019 •

edited

Loading