Add KL Divergence helper #7062

ferrine · 2023-12-11T19:33:40Z

Description

Related Issue

Closes issue: #
Related issue (not closed by this PR): #

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Type of change

📚 Documentation preview 📚: https://pymc--7062.org.readthedocs.build/en/7062/

codecov · 2023-12-11T19:42:42Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (94020c9) 90.17% compared to head (e3a6c3e) 90.18%.
Report is 7 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7062      +/-   ##
==========================================
+ Coverage   90.17%   90.18%   +0.01%     
==========================================
  Files         101      103       +2     
  Lines       16932    16952      +20     
==========================================
+ Hits        15269    15289      +20     
  Misses       1663     1663

Files	Coverage Δ
pymc/distributions/__init__.py	`100.00% <100.00%> (ø)`
pymc/distributions/stats/__init__.py	`100.00% <100.00%> (ø)`
pymc/distributions/stats/kl_divergence.py	`100.00% <100.00%> (ø)`
pymc/logprob/__init__.py	`100.00% <ø> (ø)`
pymc/logprob/abstract.py	`96.07% <100.00%> (+0.62%)`	⬆️
pymc/logprob/basic.py	`94.48% <100.00%> (+0.07%)`	⬆️

lucianopaz

This looks very good @ferrine! One thing that you definitely need to do before you can merge this is to add a page about this to the documentation. Maybe make a folder for logprob and include a subfolder for the KL divergence, since it will start to grow as more divergences get added.

pymc/logprob/basic.py

ricardoV94 · 2023-12-11T21:38:22Z

pymc/logprob/kl_divergence.py

+    q_inputs: List[TensorVariable],
+    p_inputs: List[TensorVariable],
+):
+    _, _, _, q_mu, q_sigma = q_inputs


Should probably consider size like moment does?

ferrine · 2023-12-12T09:47:09Z

This looks very good @ferrine! One thing that you definitely need to do before you can merge this is to add a page about this to the documentation. Maybe make a folder for logprob and include a subfolder for the KL divergence, since it will start to grow as more divergences get added.

I found that our pm.Distribution like pm.Normal can pass isinstance(rv.owner.op, pm.Normal) checks, I wonder if I can further rely on such functionality

ferrine · 2023-12-12T09:53:06Z

I am also confused about the design choice, since moment implementation is scattered across the codebase...

ricardoV94 · 2023-12-12T14:04:11Z

The base functionality should be in logprob, but the specific implementations should be in Distributions

I am also confused about the design choice, since moment implementation is scattered across the codebase...

moment was implemented at a different time for a different purpose. It's not even really moment but finite_logp_point. It's never used for logprob stuff, just samplers.

I found that our pm.Distribution like pm.Normal can pass isinstance(rv.owner.op, pm.Normal) checks, I wonder if I can further rely on such functionality

You can but it won't work for things that look like Distributions but are just helpers to create distributions like pm.OrderedLogistic, pm.ZeroInflatedBinomial, or LKJCholeskyCov. What should always be safe is the rv_op that's actually returned.

pymc/distributions/continuous.py

ferrine · 2023-12-15T12:44:21Z

Moved kl into a private pymc.distribution._stats because these are functions that will be never used by anyone

ricardoV94 · 2023-12-15T12:50:07Z

I don't like the underscore, why not just distributions/kl_div?

ferrine · 2024-01-23T07:32:28Z

@ricardoV94 can you please reiterate on the review? Did I miss something?

ricardoV94 · 2024-01-23T14:13:49Z

Tests are failing with import issue

ricardoV94 · 2024-02-02T13:39:05Z

How many pairs of distributions do we expect to actually be able to support?

ferrine · 2024-02-03T11:22:55Z

How many pairs of distributions do we expect to actually be able to support?

Many of them https://www.tensorflow.org/probability/api_docs/python/tfp/distributions/kl_divergence

ricardoV94 · 2024-02-06T10:32:42Z

pymc/distributions/stats/kl_divergence.py

+    _, _, _, q_mu, q_sigma = q_inputs
+    _, _, _, p_mu, p_sigma = p_inputs
+    diff_log_scale = pt.log(q_sigma) - pt.log(p_sigma)
+    return (


May want to broadcast to size, like we do with moment, if someone does kl_div(pm.Normal.dist(shape=5), pm.Normal.dist(mu=1))

This is still relevant, you're ignoring batch dimensions encoded in the size parameter

ferrine · 2024-07-09T08:49:28Z

Just did the rebase, anything we can add or change on top of that?

ricardoV94 · 2024-07-09T09:15:40Z

pymc/logprob/abstract.py

+    q_inputs: List[TensorVariable],
+    p_inputs: List[TensorVariable],


This is wrong, RVs have non tensor inputs as well

Suggested change

q_inputs: List[TensorVariable],

p_inputs: List[TensorVariable],

q_inputs: List[Variable],

p_inputs: List[Variable],

ricardoV94 · 2024-07-09T09:16:49Z

pymc/logprob/abstract.py

+    kl = _kl_div(
+        q_rv.owner.op,
+        p_rv.owner.op,
+        q_inputs=q_rv.owner.inputs,


Perhaps pass the node instead of inputs. Allows stuff like op.dist_params(node) and op.size_param(node) inside the dispatch functions. Not sure though

lucianopaz requested changes Dec 11, 2023

View reviewed changes

ricardoV94 reviewed Dec 11, 2023

View reviewed changes

ricardoV94 reviewed Dec 15, 2023

View reviewed changes

pymc/distributions/continuous.py Outdated Show resolved Hide resolved

ferrine force-pushed the kl-div branch from 594447a to a4698c0 Compare January 22, 2024 08:12

ferrine force-pushed the kl-div branch from fff5e09 to 589f857 Compare February 1, 2024 09:45

ricardoV94 reviewed Feb 6, 2024

View reviewed changes

ricardoV94 changed the title ~~add KL Divergence helper~~ Add KL Divergence helper Feb 6, 2024

ricardoV94 added enhancements logprob labels Feb 6, 2024

ferrine added 6 commits July 9, 2024 08:47

add KL Divergence helper

cf8973a

move KL div implementation closer to Distributions

9d2eb35

rework KL

a8f1a72

add relevant test to the test suit

a769510

rename to stats

ad7f053

rename package

34d4c9e

ferrine force-pushed the kl-div branch from e3a6c3e to 34d4c9e Compare July 9, 2024 08:48

ricardoV94 reviewed Jul 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KL Divergence helper #7062

Add KL Divergence helper #7062

ferrine commented Dec 11, 2023 •

edited by ricardoV94

Loading

codecov bot commented Dec 11, 2023 •

edited

Loading

lucianopaz left a comment

ricardoV94 Dec 11, 2023

ferrine commented Dec 12, 2023

ferrine commented Dec 12, 2023 •

edited

Loading

ricardoV94 commented Dec 12, 2023 •

edited

Loading

ferrine commented Dec 15, 2023

ricardoV94 commented Dec 15, 2023 •

edited

Loading

ferrine commented Jan 23, 2024

ricardoV94 commented Jan 23, 2024

ricardoV94 commented Feb 2, 2024

ferrine commented Feb 3, 2024

ricardoV94 Feb 6, 2024 •

edited

Loading

ricardoV94 Jul 9, 2024 •

edited

Loading

ferrine commented Jul 9, 2024

ricardoV94 Jul 9, 2024

ricardoV94 Jul 9, 2024

		q_inputs: List[TensorVariable],
		p_inputs: List[TensorVariable],

Add KL Divergence helper #7062

Are you sure you want to change the base?

Add KL Divergence helper #7062

Conversation

ferrine commented Dec 11, 2023 • edited by ricardoV94 Loading

Description

Related Issue

Checklist

Type of change

codecov bot commented Dec 11, 2023 • edited Loading

Codecov Report

lucianopaz left a comment

Choose a reason for hiding this comment

ricardoV94 Dec 11, 2023

Choose a reason for hiding this comment

ferrine commented Dec 12, 2023

ferrine commented Dec 12, 2023 • edited Loading

ricardoV94 commented Dec 12, 2023 • edited Loading

ferrine commented Dec 15, 2023

ricardoV94 commented Dec 15, 2023 • edited Loading

ferrine commented Jan 23, 2024

ricardoV94 commented Jan 23, 2024

ricardoV94 commented Feb 2, 2024

ferrine commented Feb 3, 2024

ricardoV94 Feb 6, 2024 • edited Loading

Choose a reason for hiding this comment

ricardoV94 Jul 9, 2024 • edited Loading

Choose a reason for hiding this comment

ferrine commented Jul 9, 2024

ricardoV94 Jul 9, 2024

Choose a reason for hiding this comment

ricardoV94 Jul 9, 2024

Choose a reason for hiding this comment

ferrine commented Dec 11, 2023 •

edited by ricardoV94

Loading

codecov bot commented Dec 11, 2023 •

edited

Loading

ferrine commented Dec 12, 2023 •

edited

Loading

ricardoV94 commented Dec 12, 2023 •

edited

Loading

ricardoV94 commented Dec 15, 2023 •

edited

Loading

ricardoV94 Feb 6, 2024 •

edited

Loading

ricardoV94 Jul 9, 2024 •

edited

Loading