From 861c29e7e942e89f5a655488dfae901ce3344782 Mon Sep 17 00:00:00 2001
From: Haoyue Dai <hyda@cmu.edu>
Date: Sat, 3 Sep 2022 21:28:19 -0400
Subject: [PATCH 1/3] Updated the usage of CIT calling

---
 .../source/independence_tests_index/chisq.rst | 21 ++++++++++--
 .../independence_tests_index/fisherz.rst      | 22 +++++++++++--
 docs/source/independence_tests_index/gsq.rst  | 22 ++++++++++---
 docs/source/independence_tests_index/kci.rst  | 32 ++++++++++++++++---
 .../independence_tests_index/mvfisherz.rst    | 22 ++++++++++---
 5 files changed, 101 insertions(+), 18 deletions(-)

diff --git a/docs/source/independence_tests_index/chisq.rst b/docs/source/independence_tests_index/chisq.rst
index 29d0f10c..a83e4db4 100644
--- a/docs/source/independence_tests_index/chisq.rst
+++ b/docs/source/independence_tests_index/chisq.rst
@@ -5,14 +5,29 @@ Chi-Square test
 
 Perform an independence test on discrete variables using Chi-Square test.
 
-(We have updated the independence test class and the usage example hasn't been updated yet. For new class, please refer to `TestCIT.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT.py>`_ or `TestCIT_KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT_KCI.py>`_.)
-
 Usage
 --------
 .. code-block:: python
 
+    from causallearn.utils.cit import CIT
+    chisq_obj = CIT(data, "chisq") # construct a CIT instance with data and method name
+    pValue = chisq_obj(X, Y, S)
+
+Please be kindly informed that we have refactored the independence tests from functions to classes since the release `v0.1.2.8 <https://github.com/cmu-phil/causal-learn/releases/tag/0.1.2.8>`_. Speed gain and a more flexible parameters specification are enabled.
+
+For users, you may need to adjust your codes accordingly. Specifically, if you are
+
++ running a constraint-based algorithm from end to end: then you don't need to change anything. Old codes are still compatible. For example,
+.. code-block:: python
+
+    from causallearn.search.ConstraintBased.PC import pc
     from causallearn.utils.cit import chisq
-    p = chisq(data, X, Y, conditioning_set)
+    cg = pc(data, 0.05, chisq)
+
++ explicitly calculating the p-value of a test: then you need to declare the :code:`chisq_obj` and then call it as above, instead of using :code:`chisq(data, X, Y, condition_set)` as before. Note that now :code:`causallearn.utils.cit.chisq` is a string :code:`"chisq"`, instead of a function.
+
+Please see `CIT.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/cit.py>`_
+for more details on the implementation of the (conditional) independent tests.
 
 
 Parameters
diff --git a/docs/source/independence_tests_index/fisherz.rst b/docs/source/independence_tests_index/fisherz.rst
index 9a99144a..df993584 100644
--- a/docs/source/independence_tests_index/fisherz.rst
+++ b/docs/source/independence_tests_index/fisherz.rst
@@ -5,15 +5,31 @@ Fisher-z test
 
 Perform an independence test using Fisher-z's test [1]_. This test is optimal for linear-Gaussian data.
 
-(We have updated the independence test class and the usage example hasn't been updated yet. For new class, please refer to `TestCIT.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT.py>`_ or `TestCIT_KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT_KCI.py>`_.)
-
 
 Usage
 --------
 .. code-block:: python
 
+    from causallearn.utils.cit import CIT
+    fisherz_obj = CIT(data, "fisherz") # construct a CIT instance with data and method name
+    pValue = fisherz_obj(X, Y, S)
+
+Please be kindly informed that we have refactored the independence tests from functions to classes since the release `v0.1.2.8 <https://github.com/cmu-phil/causal-learn/releases/tag/0.1.2.8>`_. Speed gain and a more flexible parameters specification are enabled.
+
+For users, you may need to adjust your codes accordingly. Specifically,
+
++ If you are running a constraint-based algorithm from end to end: then you don't need to change anything. Old codes are still compatible. For example,
+.. code-block:: python
+
+    from causallearn.search.ConstraintBased.PC import pc
     from causallearn.utils.cit import fisherz
-    p = fisherz(data, X, Y, condition_set, correlation_matrix)
+    cg = pc(data, 0.05, fisherz)
+
++ If you are explicitly calculating the p-value of a test: then you need to declare the :code:`fisherz_obj` and then call it as above, instead of using :code:`fisherz(data, X, Y, condition_set)` as before. Note that now :code:`causallearn.utils.cit.fisherz` is a string :code:`"fisherz"`, instead of a function.
+
+
+Please see `CIT.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/cit.py>`_
+for more details on the implementation of the (conditional) independent tests.
 
 Parameters
 ------------
diff --git a/docs/source/independence_tests_index/gsq.rst b/docs/source/independence_tests_index/gsq.rst
index 0124d121..1e412541 100644
--- a/docs/source/independence_tests_index/gsq.rst
+++ b/docs/source/independence_tests_index/gsq.rst
@@ -5,15 +5,29 @@ G-Square test
 
 Perform an independence test using G-Square test [1]_. This test is based on the log likelihood ratio test.
 
-(We have updated the independence test class and the usage example hasn't been updated yet. For new class, please refer to `TestCIT.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT.py>`_ or `TestCIT_KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT_KCI.py>`_.)
-
-
 Usage
 --------
 .. code-block:: python
 
+    from causallearn.utils.cit import CIT
+    gsq_obj = CIT(data, "gsq") # construct a CIT instance with data and method name
+    pValue = gsq_obj(X, Y, S)
+
+Please be kindly informed that we have refactored the independence tests from functions to classes since the release `v0.1.2.8 <https://github.com/cmu-phil/causal-learn/releases/tag/0.1.2.8>`_. Speed gain and a more flexible parameters specification are enabled.
+
+For users, you may need to adjust your codes accordingly. Specifically, if you are
+
++ running a constraint-based algorithm from end to end: then you don't need to change anything. Old codes are still compatible. For example,
+.. code-block:: python
+
+    from causallearn.search.ConstraintBased.PC import pc
     from causallearn.utils.cit import gsq
-    p = gsq(data, X, Y, conditioning_set)
+    cg = pc(data, 0.05, gsq)
+
++ explicitly calculating the p-value of a test: then you need to declare the :code:`gsq_obj` and then call it as above, instead of using :code:`gsq(data, X, Y, condition_set)` as before. Note that now :code:`causallearn.utils.cit.gsq` is a string :code:`"gsq"`, instead of a function.
+
+Please see `CIT.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/cit.py>`_
+for more details on the implementation of the (conditional) independent tests.
 
 Parameters
 -------------
diff --git a/docs/source/independence_tests_index/kci.rst b/docs/source/independence_tests_index/kci.rst
index 66fddc80..6f5d861e 100644
--- a/docs/source/independence_tests_index/kci.rst
+++ b/docs/source/independence_tests_index/kci.rst
@@ -7,15 +7,39 @@ Kernel-based conditional independence (KCI) test and independence test [1]_.
 To test if x and y are conditionally or unconditionally independent on Z. For unconditional independence tests,
 Z is set to the empty set.
 
-(We have updated the independence test class and the usage example hasn't been updated yet. For new class, please refer to `TestCIT.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT.py>`_ or `TestCIT_KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT_KCI.py>`_.)
-
-
 Usage
 --------
 .. code-block:: python
 
+    from causallearn.utils.cit import CIT
+    kci_obj = CIT(data, "kci") # construct a CIT instance with data and method name
+    pValue = kci_obj(X, Y, S)
+
+The above code runs KCI with the default parameters. Or instead if you would like to specify some parameters of KCI, you may do it by e.g.,
+
+.. code-block:: python
+
+    kci_obj = CIT(data, "kci", kernelZ='Polynomial', approx=False, est_width='median', ...)
+
+See `KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/KCI/KCI.py>`_
+for more details on the parameters options of the KCI tests.
+
+
+Please be kindly informed that we have refactored the independence tests from functions to classes since the release `v0.1.2.8 <https://github.com/cmu-phil/causal-learn/releases/tag/0.1.2.8>`_. Speed gain and a more flexible parameters specification are enabled.
+
+For users, you may need to adjust your codes accordingly. Specifically, if you are
+
++ running a constraint-based algorithm from end to end: then you don't need to change anything. Old codes are still compatible. For example,
+.. code-block:: python
+
+    from causallearn.search.ConstraintBased.PC import pc
     from causallearn.utils.cit import kci
-    p = kci(data, X, Y, condition_set, kernelX, kernelY, kernelZ, est_width, polyd, kwidthx, kwidthy, kwidthz)
+    cg = pc(data, 0.05, kci)
+
++ explicitly calculating the p-value of a test: then you need to declare the :code:`kci_obj` and then call it as above, instead of using :code:`kci(data, X, Y, condition_set)` as before. Note that now :code:`causallearn.utils.cit.kci` is a string :code:`"kci"`, instead of a function.
+
+Please see `CIT.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/cit.py>`_
+for more details on the implementation of the (conditional) independent tests.
 
 Parameters
 -------------
diff --git a/docs/source/independence_tests_index/mvfisherz.rst b/docs/source/independence_tests_index/mvfisherz.rst
index e0773afe..5854d807 100644
--- a/docs/source/independence_tests_index/mvfisherz.rst
+++ b/docs/source/independence_tests_index/mvfisherz.rst
@@ -6,15 +6,29 @@ Missing-value Fisher-z test
 Perform a testwise-deletion Fisher-z independence test to data sets with missing values.
 With testwise-deletion, the test makes use of all data points that do not have missing values for the variables involved in the test.
 
-(We have updated the independence test class and the usage example hasn't been updated yet. For new class, please refer to `TestCIT.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT.py>`_ or `TestCIT_KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestCIT_KCI.py>`_.)
-
-
 Usage
 --------
 .. code-block:: python
 
+    from causallearn.utils.cit import CIT
+    mv_fisherz_obj = CIT(data_with_missingness, "mv_fisherz") # construct a CIT instance with data and method name
+    pValue = mv_fisherz_obj(X, Y, S)
+
+Please be kindly informed that we have refactored the independence tests from functions to classes since the release `v0.1.2.8 <https://github.com/cmu-phil/causal-learn/releases/tag/0.1.2.8>`_. Speed gain and a more flexible parameters specification are enabled.
+
+For users, you may need to adjust your codes accordingly. Specifically, if you are
+
++ running a constraint-based algorithm from end to end: then you don't need to change anything. Old codes are still compatible. For example,
+.. code-block:: python
+
+    from causallearn.search.ConstraintBased.PC import pc
     from causallearn.utils.cit import mv_fisherz
-    p = mv_fisherz(mvdata, X, Y, condition_set)
+    cg = pc(data_with_missingness, 0.05, mv_fisherz)
+
++ explicitly calculating the p-value of a test: then you need to declare the :code:`mv_fisherz_obj` and then call it as above, instead of using :code:`mv_fisherz(data, X, Y, condition_set)` as before. Note that now :code:`causallearn.utils.cit.mv_fisherz` is a string :code:`"mv_fisherz"`, instead of a function.
+
+Please see `CIT.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/cit.py>`_
+for more details on the implementation of the (conditional) independent tests.
 
 
 Parameters

From 97a10911e98d0374ee113c9b46314dc0b39a9bac Mon Sep 17 00:00:00 2001
From: Haoyue Dai <hyda@cmu.edu>
Date: Sat, 3 Sep 2022 21:28:36 -0400
Subject: [PATCH 2/3] Updated some advanced usages for PC

---
 .../PC.rst                                    | 29 ++++++++++++++++++-
 1 file changed, 28 insertions(+), 1 deletion(-)

diff --git a/docs/source/search_methods_index/Constraint-based causal discovery methods/PC.rst b/docs/source/search_methods_index/Constraint-based causal discovery methods/PC.rst
index 23eee23a..80b5bdcf 100644
--- a/docs/source/search_methods_index/Constraint-based causal discovery methods/PC.rst	
+++ b/docs/source/search_methods_index/Constraint-based causal discovery methods/PC.rst	
@@ -35,6 +35,33 @@ Usage
 
 Visualization using pydot is recommended. If specific label names are needed, please refer to this `usage example <https://github.com/cmu-phil/causal-learn/blob/main/tests/TestGraphVisualization.py>`_ (e.g., 'cg.draw_pydot_graph(labels=["A", "B", "C"])' or 'GraphUtils.to_pydot(cg.G, labels=["A", "B", "C"])').
 
++++++++++++++++
+Advanced Usages
++++++++++++++++
++ If you would like to specify parameters for the (conditional) independence test (if available), you may directly pass the parameters to the :code:`pc` call. E.g.,
+
+  .. code-block:: python
+
+    from causallearn.search.ConstraintBased.PC import pc
+    from causallearn.utils.cit import kci
+    cg = pc(data, 0.05, kci, kernelZ='Polynomial', approx=False, est_width='median', ...)
+
++ If your graph is big and/or your independence test is slow (e.g., KCI), you may want to cache the p-value results to a local checkpoint. Then by reading values from this local checkpoint, no more repeated calculation will be wasted to resume from checkpoint / just finetune some PC parameters. This can be achieved by specifying :code:`cache_path`. E.g.,
+
+  .. code-block:: python
+
+        citest_cache_file = "/my/path/to/citest_cache_dataname_kci.json"    # .json file
+        cg1 = pc(data, 0.05, kci, cache_path=citest_cache_file)             # after the long run
+
+        # just finetune uc_rule. p-values are reused, and thus cg2 is done in almost no time.
+        cg2 = pc(data, 0.05, kci, cache_path=citest_cache_file, uc_rule=1)
+  ..
+
+  If :code:`cache_path` does not exist in your local file system, a new one will be created. Otherwise, the cache will be first loaded from the json file to the CIT class and used during the runtime. Note that 1) data hash and parameters hash will first be checked at loading to ensure consistency, and 2) during runtime, the cache will be saved to the local file every 30 seconds.
+
++ The above advanced usages also apply to other constraint-based methods, e.g., FCI and CDNOD.
+
+
 Parameters
 -------------------
 **data**: numpy.ndarray, shape (n_samples, n_features). Data, where n_samples is the number of samples
@@ -42,7 +69,7 @@ and n_features is the number of features.
 
 **alpha**: desired significance level (float) in (0, 1). Default: 0.05.
 
-**indep_test**: Independence test method function. Default: 'fisherz'.
+**indep_test**: string, name of the independence test method. Default: 'fisherz'.
        - ":ref:`fisherz <Fisher-z test>`": Fisher's Z conditional independence test.
        - ":ref:`chisq <Chi-Square test>`": Chi-squared conditional independence test.
        - ":ref:`gsq <G-Square test>`": G-squared conditional independence test.

From e531b5afb8707dbe010ece8e713f03b74e2f3031 Mon Sep 17 00:00:00 2001
From: Haoyue Dai <hyda@cmu.edu>
Date: Sat, 3 Sep 2022 21:58:32 -0400
Subject: [PATCH 3/3] Updated the parameters for CITs docs

---
 .../source/independence_tests_index/chisq.rst |  5 ++---
 .../independence_tests_index/fisherz.rst      |  4 ++--
 docs/source/independence_tests_index/gsq.rst  |  4 ++--
 docs/source/independence_tests_index/kci.rst  | 22 +++++++++++--------
 .../independence_tests_index/mvfisherz.rst    |  8 ++++---
 5 files changed, 24 insertions(+), 19 deletions(-)

diff --git a/docs/source/independence_tests_index/chisq.rst b/docs/source/independence_tests_index/chisq.rst
index a83e4db4..beb41b33 100644
--- a/docs/source/independence_tests_index/chisq.rst
+++ b/docs/source/independence_tests_index/chisq.rst
@@ -35,10 +35,9 @@ Parameters
 **data**: numpy.ndarray, shape (n_samples, n_features). Data, where n_samples is the number of samples
 and n_features is the number of features.
 
-**X, Y and condition_set**: column indices of data.
+**method**: string, "chisq".
 
-**G_sq**: True means using G-Square test;
-       False means using Chi-Square test.
+**kwargs**: e.g., :code:`cache_path`. See :ref:`Advanced Usages <Advanced Usages>`.
 
 Returns
 -------------
diff --git a/docs/source/independence_tests_index/fisherz.rst b/docs/source/independence_tests_index/fisherz.rst
index df993584..cb8e0072 100644
--- a/docs/source/independence_tests_index/fisherz.rst
+++ b/docs/source/independence_tests_index/fisherz.rst
@@ -36,9 +36,9 @@ Parameters
 **data**: numpy.ndarray, shape (n_samples, n_features). Data, where n_samples is the number of samples
 and n_features is the number of features.
 
-**X, Y and condition_set**: column indices of data.
+**method**: string, "fisherz".
 
-**correlation_matrix**: correlation matrix; None means without the parameter of correlation matrix.
+**kwargs**: e.g., :code:`cache_path`. See :ref:`Advanced Usages <Advanced Usages>`.
 
 Returns
 -------------
diff --git a/docs/source/independence_tests_index/gsq.rst b/docs/source/independence_tests_index/gsq.rst
index 1e412541..9a3bd402 100644
--- a/docs/source/independence_tests_index/gsq.rst
+++ b/docs/source/independence_tests_index/gsq.rst
@@ -34,9 +34,9 @@ Parameters
 **data**: numpy.ndarray, shape (n_samples, n_features). Data, where n_samples is the number of samples
 and n_features is the number of features.
 
-**X, Y and condition_set**: column indices of data.
+**method**: string, "gsq".
 
-**G_sq**: True means using G-Square test; False means using Chi-Square test.
+**kwargs**: e.g., :code:`cache_path`. See :ref:`Advanced Usages <Advanced Usages>`.
 
 Returns
 ---------------
diff --git a/docs/source/independence_tests_index/kci.rst b/docs/source/independence_tests_index/kci.rst
index 6f5d861e..f3f73513 100644
--- a/docs/source/independence_tests_index/kci.rst
+++ b/docs/source/independence_tests_index/kci.rst
@@ -42,26 +42,30 @@ Please see `CIT.py <https://github.com/cmu-phil/causal-learn/blob/main/causallea
 for more details on the implementation of the (conditional) independent tests.
 
 Parameters
--------------
+------------
 **data**: numpy.ndarray, shape (n_samples, n_features). Data, where n_samples is the number of samples
 and n_features is the number of features.
 
-**X, Y, and condition_set**: column indices of data. condition_set could be None.
+**method**: string, "kci".
 
-**KernelX/Y/Z (condition_set)**: ['GaussianKernel', 'LinearKernel', 'PolynomialKernel'].
-(For 'PolynomialKernel', the default degree is 2. Currently, users can change it by setting the 'degree' of 'class PolynomialKernel()'.
+**kwargs**:
 
-**est_width**: set kernel width for Gaussian kernels.
++ Either for specifying parameters of KCI, including:
+
+  **KernelX/Y/Z (condition_set)**: ['GaussianKernel', 'LinearKernel', 'PolynomialKernel']. (For 'PolynomialKernel', the default degree is 2. Currently, users can change it by setting the 'degree' of 'class PolynomialKernel()'.
+
+  **est_width**: set kernel width for Gaussian kernels.
    - 'empirical': set kernel width using empirical rules (default).
    - 'median': set kernel width using the median trick.
 
-**polyd**: polynomial kernel degrees (default=2).
+  **polyd**: polynomial kernel degrees (default=2).
+
+  **kwidthx/y/z**: kernel width for data x/y/z (standard deviation sigma).
 
-**kwidthx**: kernel width for data x (standard deviation sigma).
+  **and more**: aee `KCI.py <https://github.com/cmu-phil/causal-learn/blob/main/causallearn/utils/KCI/KCI.py>`_ for details.
 
-**kwidthy**: kernel width for data y (standard deviation sigma).
++ Or for advanced usages of CIT, e.g., :code:`cache_path`. See :ref:`Advanced Usages <Advanced Usages>`.
 
-**kwidthz**: kernel width for data z (standard deviation sigma).
 
 Returns
 -----------
diff --git a/docs/source/independence_tests_index/mvfisherz.rst b/docs/source/independence_tests_index/mvfisherz.rst
index 5854d807..5dd64ca6 100644
--- a/docs/source/independence_tests_index/mvfisherz.rst
+++ b/docs/source/independence_tests_index/mvfisherz.rst
@@ -32,11 +32,13 @@ for more details on the implementation of the (conditional) independent tests.
 
 
 Parameters
----------------
-**mvdata**: numpy.ndarray, shape (n_samples, n_features). Data with missing value, where n_samples is the number of samples
+------------
+**data**: numpy.ndarray, shape (n_samples, n_features). Data, where n_samples is the number of samples
 and n_features is the number of features.
 
-**X, Y and condition_set**: column indices of data.
+**method**: string, "mv_fisherz".
+
+**kwargs**: e.g., :code:`cache_path`. See :ref:`Advanced Usages <Advanced Usages>`.
 
 Returns
 ----------------