GitHub - suicee/MI4StatsEval: Code for using Mutual information to evaluate the effectiveness of summary statistics

Evaluation of Mutual Information between Summary Statistics and Physical Parameters

This repository provides a detailed implementation of the paper Evaluating Summary Statistics with Mutual Information for Cosmological Inference, along with other experiments and codes regarding this topic.

Basic Idea

In this research, we study the application of mutual information to assess the effectiveness of various summary statistics in cosmological inference. Our approach utilizes the Barber-Agakov lower bound and normalizing flow based variational distributions. For more details, please refer to our paper.

Experiments

We conduct two experiments to demonstrate our method, using BA lower bound:

We show that MI can correctly assess the effectiveness of different summary statistics in a given inference task.

Other MI estimators

Additionally, we have tried other mutual information estimators from papers such as On Variational Bounds of Mutual Information, Understanding the Limitations of Variational Mutual Information Estimators, Estimating Mutual Information.

The other_MI_estimators folder currently consists of two notebooks, each utilizing a different method to evaluate the MI in the second experiment mentioned above:

MI_Estimation_Smile.ipynb: Here, we conduct experiments with the SMILE estimator, which is a variance-reduced version of MINE. This method employs the Donsker-Varadhan (DV) lower bound.
MI_Estimation_KSG.ipynb: In this notebook, we explore the usage of the KSG estimator, a K-nearest neighbor-based MI estimation method. However, since KSG is applicable only to low-dimensional data, we integrate it with a further compression operation for the summaries.
MGC_CMB.ipynb and MGC.ipynb: In these two notebooks we explore the usage of the Multiscale Graph Correlation (MGC) to evaluate the correlation between different summaries and parameters.

In all of our experiments, we have consistently observed that training a well-performing model is crucial for obtaining reliable results. Enhancing the training procedure with more sophisticated techniques can potentially improve the results obtained using all of these methods. If you have any ideas, comments or questions, please feel free to reach out to Ce Sui at suic20@mails.tsinghua.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
lagacy		lagacy
method_test/MGC		method_test/MGC
other_MI_estimators		other_MI_estimators
tools		tools
MI_Estimation_21cm.ipynb		MI_Estimation_21cm.ipynb
MI_Estimation_CMB.ipynb		MI_Estimation_CMB.ipynb
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lagacy

lagacy

method_test/MGC

method_test/MGC

other_MI_estimators

other_MI_estimators

tools

tools

MI_Estimation_21cm.ipynb

MI_Estimation_21cm.ipynb

MI_Estimation_CMB.ipynb

MI_Estimation_CMB.ipynb

README.md

README.md

environment.yml

environment.yml

Repository files navigation

Evaluation of Mutual Information between Summary Statistics and Physical Parameters

Basic Idea

Experiments

Other MI estimators

About

Releases

Packages

Contributors 2

Languages

suicee/MI4StatsEval

Folders and files

Latest commit

History

Repository files navigation

Evaluation of Mutual Information between Summary Statistics and Physical Parameters

Basic Idea

Experiments

Other MI estimators

About

Resources

Stars

Watchers

Forks

Languages