Perplexity: add clipping and from_logits by jeffcarp · Pull Request #47 · google/metrax

jeffcarp · 2025-03-27T23:39:38Z

It was pointed out that Perplexity returns NaNs for negative values. This is because our implementation did not clip logit values to [0, 1], whereas the Keras implementation does. [1]

Even with that fix, the tests were failing because Keras defaults to the TensorFlow version of the metric, which applies softmax to the outputs unconditionally [2], unlike the JAX implementation which does not. [3]

Also:

Added a from_logits arg, similar to Keras, for users who want to pass raw logits and have us apply softmax internally.
Forced all Keras metrics in tests to use the JAX backend for parity.

[1] https://github.com/keras-team/keras/blob/3f8b065e82b17884bd43fcfbd4bd79f18a7019fe/keras/src/backend/jax/nn.py#L582
[2] https://www.tensorflow.org/api_docs/python/tf/nn/sparse_softmax_cross_entropy_with_logits
[3] https://github.com/keras-team/keras/blob/3f8b065e82b17884bd43fcfbd4bd79f18a7019fe/keras/src/backend/jax/nn.py#L578-L579

jeffcarp · 2025-03-27T23:50:08Z

Looking into the test failures... looks like it only fails when the whole test suite is run?

It was pointed out that Perplexity returns NaNs for negative values. This is because our implementation did not clip logit values to [0, 1], whereas the Keras implementation does. [1] Even with that fix, the tests were failing because Keras defaults to the TensorFlow version of the metric, which applies softmax to the outputs unconditionally [2], unlike the JAX implementation which does not. [3] I also added a `from_logits` arg, similar to Keras, for users who want to pass raw logits and have us apply softmax internally. [1] https://github.com/keras-team/keras/blob/3f8b065e82b17884bd43fcfbd4bd79f18a7019fe/keras/src/backend/jax/nn.py#L582 [2] https://www.tensorflow.org/api_docs/python/tf/nn/sparse_softmax_cross_entropy_with_logits [3] https://github.com/keras-team/keras/blob/3f8b065e82b17884bd43fcfbd4bd79f18a7019fe/keras/src/backend/jax/nn.py#L578-L579

jeffcarp · 2025-03-28T23:37:45Z

Found the issue - when Keras is imported in other test files first it doesn't have KERAS_BACKEND set correctly.

jeffcarp requested a review from jshin1394 March 27, 2025 23:39

jeffcarp marked this pull request as draft March 27, 2025 23:48

jeffcarp force-pushed the fix-perplexity branch from 4151482 to dedc238 Compare March 28, 2025 23:36

jeffcarp marked this pull request as ready for review March 28, 2025 23:37

jshin1394 merged commit 79fefa2 into google:main Mar 31, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perplexity: add clipping and from_logits#47

Perplexity: add clipping and from_logits#47
jshin1394 merged 1 commit intogoogle:mainfrom
jeffcarp:fix-perplexity

jeffcarp commented Mar 27, 2025 •

edited

Loading

Uh oh!

jeffcarp commented Mar 27, 2025 •

edited

Loading

Uh oh!

jeffcarp commented Mar 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jeffcarp commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeffcarp commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeffcarp commented Mar 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jeffcarp commented Mar 27, 2025 •

edited

Loading

jeffcarp commented Mar 27, 2025 •

edited

Loading