First, import the tools and define the object that will help us work with papers

In [1]:
from litrevtools import LitrevTools

In [2]:
lt = LitrevTools()

First, start with a list of papers. For example, take a few papers that come up on Google and Google Scholar when looking about causal representation learning (CRL), or that you might know are popular in this area.

In [3]:
crl_titles_initial = [
    'Towards Causal Representation Learning',
    'Identifiable Causal Representation Learning: Unsupervised, Multi-View, and Multi-Environment',
    'Interventional Causal Representation Learning',
    'Linear Causal Disentanglement via Interventions'
]

Compute how many total citations, and how many average citations per day they have

In [4]:
citations_dict = lt.citation_counts(crl_titles_initial)

100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:26<00:00,  6.51s/it]


In [5]:
citations_dict

{'citation counts': {'Towards Causal Representation Learning': 307,
  'Identifiable Causal Representation Learning: Unsupervised, Multi-View, and Multi-Environment': 0,
  'Interventional Causal Representation Learning': 66,
  'Linear Causal Disentanglement via Interventions': 52},
 'daily citation counts': {'Towards Causal Representation Learning': 0.22791388270230142,
  'Identifiable Causal Representation Learning: Unsupervised, Multi-View, and Multi-Environment': 0.0,
  'Interventional Causal Representation Learning': 0.0859375,
  'Linear Causal Disentanglement via Interventions': 0.07407407407407407}}

Now, how to find all CRL papers, or at least enough of them? A strategy is to find all papers citing or being cited these initial CRL papers and mentioning causal representation learning in their title or abstract. One of the initial papers launched the field, and another is a PhD thesis, so we expect them to have a citation or reference connection with all the relevant papers. As we see above, CRL can be referred to as "causal representation learning" but also "causal disentanglement". Thus, we run the following command, that finds all references and citations of the initial CRL papers, and have at least either of "causal representation learning" or "causal disentanglement" in their title or abstract. The below method uses the Semantic Scholar API.

In [6]:
crl_titles = lt.bulldozer(crl_titles_initial, keywords = ["causal representation learning", "causal disentanglement"])

100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:15<00:00,  3.96s/it]


In [7]:
for idx, title in enumerate(crl_titles):
    print(idx+1, ' : ', title)

1  :  continual learning of nonlinear independent representations
2  :  identifiable exchangeable mechanisms for causal structure and representation learning
3  :  causal representation learning from multiple distributions: a general setting
4  :  when graph neural network meets causality: opportunities, methodologies and an outlook
5  :  causal representation learning made identifiable by grouping of observational variables
6  :  generative causal representation learning for out-of-distribution motion forecasting
7  :  causal triplet: an open challenge for intervention-centric causal representation learning
8  :  less data, more knowledge: building next generation semantic communication networks
9  :  domain generalization - a causal perspective
10  :  causal representation learning for instantaneous and temporal effects in interactive systems
11  :  do-operation guided causal representation learning with reduced supervision strength
12  :  causal machine learning for healthcare and p

We also check the CRL papers published on arXiV in September 2024, using the arXiV API. 

In [8]:
crl_titles_arxiv = lt.parse_arxiv(start='2024-09-01', end='2024-09-30', keywords = ["causal representation learning", "causal disentanglement"])

In [9]:
for idx, title in enumerate(crl_titles_arxiv):
    print(idx+1, ' : ', title)

1  :  Identifying Weight-Variant Latent Causal Models
2  :  Unifying Causal Representation Learning with the Invariance Principle
3  :  Celcomen: spatial causal disentanglement for single-cell and tissue perturbation modeling
4  :  CauSkelNet: Causal Representation Learning for Human Behaviour Analysis


Papers Celcomen and CauSkelNet of this arXiV list are not in the above list. This might be fixed by doing another round of the bulldozer method, normally finding all "missed" papers that were previously forgotten. We also specify that we do not want the initial CRL papers, in addition to the input papers, to be returned in this new search, through the queue parameter.

In [10]:
crl_titles_bis = lt.bulldozer(crl_titles, queue = crl_titles_initial + crl_titles, keywords = ["causal representation learning", "causal disentanglement"])

 59%|██████████████████████████████████████████████████████████████████▎                                             | 32/54 [01:54<01:13,  3.34s/it]

No result ! <-  invariance & causal representation learning: prospects and limitations


100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 54/54 [02:56<00:00,  3.27s/it]


In [11]:
for idx, title in enumerate(crl_titles_bis):
    print(idx+1, ' : ', title)

1  :  uni-fying causal representation learning with the invariance principle, september 2024. url http://arxiv.org/abs/2409.02772
2  :  on the identification of temporally causal representation with instantaneous dependence
3  :  a twist for graph classification: optimizing causal information flow in graph neural networks
4  :  weakly supervised disentangled generative causal representation learning
5  :  invariant causal representation learning for out-of-distribution generalization
6  :  identifying weight-variant latent causal models
7  :  causal reasoning: charting a revolutionary course for next-generation ai-native wireless networks
8  :  causal disentanglement with network information for debiased recommendations
9  :  learning causal representation for training cross-domain pose estimator via generative interventions
10  :  disentangled generative causal representation learning
11  :  towards the reusability and compositionality of causal representations
12  :  biscuit: causal 

Thus, the Celcomen paper has now been found by the bulldozer method. The CauSkelNet is still not included, but a quick look of its Semantic Scholar shows that it does not have any reference or citation with the keywords "causal representation learning" or "causal disentanglement". This shows the bulldozer method should be complemented with more "naive" ways of finding papers, eg by a simple search on Google Scholar or Semantic Scholar.

Next, say that actually we only care about CRL methods that are score-based and have identifiability results. The further methods further filters the found CRL papers that have both "score" and "identif" (that encapsulates words such as "identify", "identifiability", or "identifiable") in their title or abstract. We also manually review and select every filtered paper ; notably, this requires the user's intervention. As you can see, for the keywords, tuples encode "and", and lists encode "or". The keywords argument can be made even more recursive, e.g. having `['x', ('y', 'z')]` which means " x or (y and z)". Unlike the bulldozer method for references and citations, here abstracts are being looked for on arXiV, then Semantic Scholar if not found on arXiV.

In [12]:
crl_titles_all = crl_titles_initial +  crl_titles + crl_titles_bis

In [13]:
crl_titles_filtered = lt.filter(crl_titles_all, keywords = ("score", "identif"))

Getting abstracts


  0%|                                                                                                                        | 0/114 [00:00<?, ?it/s]

Trying title 'Towards Causal Representation Learning' with 'arxiv'


  1%|▉                                                                                                               | 1/114 [00:02<04:08,  2.20s/it]

Found!
Trying title 'Identifiable Causal Representation Learning: Unsupervised, Multi-View, and Multi-Environment' with 'arxiv'


  2%|█▉                                                                                                              | 2/114 [00:04<04:03,  2.17s/it]

Found!
Trying title 'Interventional Causal Representation Learning' with 'arxiv'


  3%|██▉                                                                                                             | 3/114 [00:06<03:42,  2.00s/it]

Found!
Trying title 'Linear Causal Disentanglement via Interventions' with 'arxiv'


  4%|███▉                                                                                                            | 4/114 [00:06<02:40,  1.46s/it]

Found!
Trying title 'continual learning of nonlinear independent representations' with 'arxiv'


  4%|████▉                                                                                                           | 5/114 [00:08<02:38,  1.45s/it]

Found!
Trying title 'identifiable exchangeable mechanisms for causal structure and representation learning' with 'arxiv'


  5%|█████▉                                                                                                          | 6/114 [00:09<02:21,  1.31s/it]

Found!
Trying title 'causal representation learning from multiple distributions: a general setting' with 'arxiv'


  6%|██████▉                                                                                                         | 7/114 [00:10<02:03,  1.16s/it]

Found!
Trying title 'when graph neural network meets causality: opportunities, methodologies and an outlook' with 'arxiv'


  7%|███████▊                                                                                                        | 8/114 [00:11<01:59,  1.12s/it]

Found!
Trying title 'causal representation learning made identifiable by grouping of observational variables' with 'arxiv'


  8%|████████▊                                                                                                       | 9/114 [00:11<01:48,  1.04s/it]

Found!
Trying title 'generative causal representation learning for out-of-distribution motion forecasting' with 'arxiv'


  9%|█████████▋                                                                                                     | 10/114 [00:14<02:30,  1.44s/it]

Found!
Trying title 'causal triplet: an open challenge for intervention-centric causal representation learning' with 'arxiv'


 10%|██████████▋                                                                                                    | 11/114 [00:16<02:50,  1.66s/it]

Found!
Trying title 'less data, more knowledge: building next generation semantic communication networks' with 'arxiv'


 11%|███████████▋                                                                                                   | 12/114 [00:17<02:16,  1.34s/it]

Found!
Trying title 'domain generalization - a causal perspective' with 'arxiv'


 11%|████████████▋                                                                                                  | 13/114 [00:19<02:43,  1.62s/it]

Found!
Trying title 'causal representation learning for instantaneous and temporal effects in interactive systems' with 'arxiv'


 12%|█████████████▋                                                                                                 | 14/114 [00:19<02:11,  1.32s/it]

Found!
Trying title 'do-operation guided causal representation learning with reduced supervision strength' with 'arxiv'


 13%|██████████████▌                                                                                                | 15/114 [00:22<02:32,  1.54s/it]

Found!
Trying title 'causal machine learning for healthcare and precision medicine' with 'arxiv'


 14%|███████████████▌                                                                                               | 16/114 [00:22<02:09,  1.32s/it]

Found!
Trying title 'weakly supervised causal representation learning' with 'arxiv'


 15%|████████████████▌                                                                                              | 17/114 [00:23<01:48,  1.12s/it]

Found!
Trying title 'multi-instance causal representation learning for instance label prediction and out-of-distribution generalization' with 'arxiv'


 16%|█████████████████▌                                                                                             | 18/114 [00:25<02:10,  1.36s/it]

Found!
Trying title 'generative multitask learning mitigates target-causing confounding' with 'arxiv'


 17%|██████████████████▌                                                                                            | 19/114 [00:27<02:33,  1.61s/it]

Found!
Trying title 'citris: causal identifiability from temporal intervened sequences' with 'arxiv'


 18%|███████████████████▍                                                                                           | 20/114 [00:28<02:02,  1.30s/it]

Found!
Trying title 'toward causal representation learning' with 'arxiv'


 18%|████████████████████▍                                                                                          | 21/114 [00:29<02:08,  1.38s/it]

Found!
Trying title 'connectivity-contrastive learning: combining causal discovery and representation learning for multimodal data' with 'arxiv'
Titles do not correspond :
connectivity-contrastive learning: combining causal discovery and representation learning for multimodal data
Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning
Got None when trying title 'connectivity-contrastive learning: combining causal discovery and representation learning for multimodal data' with 'arxiv'
Trying title 'connectivity-contrastive learning: combining causal discovery and representation learning for multimodal data' with 'semanticscholar'


 19%|█████████████████████▍                                                                                         | 22/114 [00:32<02:50,  1.85s/it]

Found!
Trying title 'survey on trustworthy graph neural networks: from a causal perspective' with 'arxiv'


 20%|██████████████████████▍                                                                                        | 23/114 [00:34<02:55,  1.93s/it]

Found!
Trying title 'edinburgh research explorer causal machine learning for healthcare and precision medicine' with 'arxiv'
Titles do not correspond :
edinburgh research explorer causal machine learning for healthcare and precision medicine
Causal Machine Learning for Healthcare and Precision Medicine
Got None when trying title 'edinburgh research explorer causal machine learning for healthcare and precision medicine' with 'arxiv'
Trying title 'edinburgh research explorer causal machine learning for healthcare and precision medicine' with 'semanticscholar'


 21%|███████████████████████▎                                                                                       | 24/114 [00:37<03:11,  2.13s/it]

Found!
Weird paperdict, some vital fields (author, year and/or title) are missing. Imputing what's missing with NA. Please check the original paperdict:  {'title': 'Edinburgh Research Explorer Causal Machine Learning for Healthcare and Precision Medicine', 'author': "Pedro Sanchez and J. Voisey and Tian Xia and Hannah I. Watson and Alison Q. O'Neil and S. Tsaftaris", 'ENTRYTYPE': 'misc', 'ID': 'None', 'abstract': 'Causal machine learning (CML) has experienced increasing popularity in health-care. Beyond the inherent capabilities of adding domain knowledge into learning systems, CML provides a complete toolset for investigating how a system would react to an intervention (e.g. outcome given a treatment). Quantifying effects of interventions allows actionable decisions to be made whilst maintaining robustness in the presence of confounders. Here, we explore how causal inference can be incorporated into different aspects of clinical decision support (CDS) systems by using recent advances 

 22%|████████████████████████▎                                                                                      | 25/114 [00:38<02:34,  1.74s/it]

Found!
Trying title 'unifying causal representation learning with the invariance principle' with 'arxiv'


 23%|█████████████████████████▎                                                                                     | 26/114 [00:39<02:11,  1.50s/it]

Found!
Trying title 'causal representation learning for gan-generated face image quality assessment' with 'arxiv'
Titles do not correspond :
causal representation learning for gan-generated face image quality assessment
Image Content Generation with Causal Reasoning
Got None when trying title 'causal representation learning for gan-generated face image quality assessment' with 'arxiv'
Trying title 'causal representation learning for gan-generated face image quality assessment' with 'semanticscholar'


 24%|██████████████████████████▎                                                                                    | 27/114 [00:51<07:05,  4.89s/it]

Found!
Trying title 'linear causal disentanglement via higher-order cumulants' with 'arxiv'


 25%|███████████████████████████▎                                                                                   | 28/114 [00:53<05:43,  4.00s/it]

Found!
Trying title 'diffusion-based causal representation learning' with 'arxiv'


 25%|████████████████████████████▏                                                                                  | 29/114 [00:55<04:46,  3.37s/it]

Found!
Trying title 'linear causal representation learning from unknown multi-node interventions' with 'arxiv'
Error in Google Search without proxy - trying with proxy
Google search crashed
Got None when trying title 'linear causal representation learning from unknown multi-node interventions' with 'arxiv'
Trying title 'linear causal representation learning from unknown multi-node interventions' with 'semanticscholar'


Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 357, in search_google
    return next(url for url in google_search_module(query))
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
StopIteration

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 379, in search_arxiv
    id_list=[self.search_google(title + ' site:arxiv.org').split('/')[-1]],
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 364, in search_google
    raise "No proxy available! Error!"
TypeError: exceptions must derive from BaseException
 26%|█████████████████████████████▏                                                                                 | 30/114 [00:57<03:52,  2.77s/it]

Found!
Trying title 'marrying causal representation learning with dynamical systems for science' with 'arxiv'


 27%|██████████████████████████████▏                                                                                | 31/114 [00:57<02:57,  2.14s/it]

Found!
Trying title 'identifiable latent neural causal models' with 'arxiv'


 28%|███████████████████████████████▏                                                                               | 32/114 [00:58<02:28,  1.81s/it]

Found!
Trying title 'a sparsity principle for partially observable causal representation learning' with 'arxiv'


 29%|████████████████████████████████▏                                                                              | 33/114 [00:59<02:02,  1.52s/it]

Found!
Trying title 'implicit causal representation learning via switchable mechanisms' with 'arxiv'


 30%|█████████████████████████████████                                                                              | 34/114 [01:00<01:39,  1.25s/it]

Found!
Trying title 'learning interpretable concepts: unifying causal representation learning and foundation models' with 'arxiv'


 31%|██████████████████████████████████                                                                             | 35/114 [01:00<01:23,  1.05s/it]

Found!
Trying title 'invariance & causal representation learning: prospects and limitations' with 'arxiv'


 32%|███████████████████████████████████                                                                            | 36/114 [01:01<01:21,  1.04s/it]

Found!
Trying title 'multi-view causal representation learning with partial observability' with 'arxiv'


 32%|████████████████████████████████████                                                                           | 37/114 [01:04<01:46,  1.38s/it]

Found!
Trying title 'identifying linearly-mixed causal representations from multi-node interventions' with 'arxiv'
Error in Google Search without proxy - trying with proxy
Google search crashed
Got None when trying title 'identifying linearly-mixed causal representations from multi-node interventions' with 'arxiv'
Trying title 'identifying linearly-mixed causal representations from multi-node interventions' with 'semanticscholar'


Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 357, in search_google
    return next(url for url in google_search_module(query))
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
StopIteration

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 379, in search_arxiv
    id_list=[self.search_google(title + ' site:arxiv.org').split('/')[-1]],
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 364, in search_google
    raise "No proxy available! Error!"
TypeError: exceptions must derive from BaseException
 33%|█████████████████████████████████████                                                                          | 38/114 [01:05<01:41,  1.33s/it]

Found!
Trying title 'object-centric architectures enable efficient causal representation learning' with 'arxiv'


 34%|█████████████████████████████████████▉                                                                         | 39/114 [01:07<01:59,  1.60s/it]

Found!
Trying title 'identifiable latent polynomial causal models through the lens of change' with 'arxiv'


 35%|██████████████████████████████████████▉                                                                        | 40/114 [01:08<01:38,  1.32s/it]

Found!
Trying title 'general identifiability and achievability for causal representation learning' with 'arxiv'


 36%|███████████████████████████████████████▉                                                                       | 41/114 [01:11<02:10,  1.79s/it]

Found!
Trying title 'from identifiable causal representations to controllable counterfactual generation: a survey on causal generative modeling' with 'arxiv'


 37%|████████████████████████████████████████▉                                                                      | 42/114 [01:11<01:44,  1.46s/it]

Found!
Trying title 'identifying representations for intervention extrapolation' with 'arxiv'


 38%|█████████████████████████████████████████▊                                                                     | 43/114 [01:12<01:29,  1.26s/it]

Found!
Trying title 'multi-domain causal representation learning via weak distributional invariances' with 'arxiv'


 39%|██████████████████████████████████████████▊                                                                    | 44/114 [01:14<01:50,  1.58s/it]

Found!
Trying title 'identifiability guarantees for causal disentanglement from soft interventions' with 'arxiv'


 39%|███████████████████████████████████████████▊                                                                   | 45/114 [01:15<01:29,  1.30s/it]

Found!
Trying title 'a causal ordering prior for unsupervised representation learning' with 'arxiv'


 40%|████████████████████████████████████████████▊                                                                  | 46/114 [01:17<01:37,  1.43s/it]

Found!
Trying title 'learning causally disentangled representations via the principle of independent causal mechanisms' with 'arxiv'


 41%|█████████████████████████████████████████████▊                                                                 | 47/114 [01:18<01:22,  1.23s/it]

Found!
Trying title 'nonparametric identifiability of causal representations from unknown interventions' with 'arxiv'


 42%|██████████████████████████████████████████████▋                                                                | 48/114 [01:18<01:13,  1.11s/it]

Found!
Trying title 'causal component analysis' with 'arxiv'


 43%|███████████████████████████████████████████████▋                                                               | 49/114 [01:19<01:01,  1.05it/s]

Found!
Trying title 'disentanglement of latent representations via sparse causal interventions' with 'arxiv'
Titles do not correspond :
disentanglement of latent representations via sparse causal interventions
Disentanglement of Latent Representations via Causal Interventions
Got None when trying title 'disentanglement of latent representations via sparse causal interventions' with 'arxiv'
Trying title 'disentanglement of latent representations via sparse causal interventions' with 'semanticscholar'


 44%|████████████████████████████████████████████████▋                                                              | 50/114 [01:21<01:28,  1.38s/it]

Found!
Trying title 'unpaired multi-domain causal representation learning' with 'arxiv'


 45%|█████████████████████████████████████████████████▋                                                             | 51/114 [01:23<01:37,  1.55s/it]

Found!
Trying title 'score-based causal representation learning with interventions' with 'arxiv'


 46%|██████████████████████████████████████████████████▋                                                            | 52/114 [01:25<01:41,  1.63s/it]

Found!
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'arxiv'
Titles do not correspond :
score-based causal representation learning from interventions: nonparametric identifiability
Score-based Causal Representation Learning with Interventions
Got None when trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'arxiv'
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'semanticscholar'


 46%|███████████████████████████████████████████████████▌                                                           | 53/114 [01:38<05:13,  5.14s/it]

Found!
Weird paperdict, some vital fields (author, year and/or title) are missing. Imputing what's missing with NA. Please check the original paperdict:  {'title': 'Score-based Causal Representation Learning from Interventions: Nonparametric Identifiability', 'author': 'Burak Varici and Emre Acartürk and Karthikeyan Shanmugam and A. Tajer', 'ENTRYTYPE': 'misc', 'ID': 'None', 'abstract': 'This paper focuses on causal representation learning (CRL) under a general non-parametric causal latent model and a general transformation model that maps the latent data to the observational data. It establishes identifiability and achievability results using two (stochastic) hard uncoupled interventions per node in the latent causal graph. Notably, one does not know which pair of intervention environments have the same node intervened (hence, uncoupled environments). For identifiability, the paper establishes that perfect recovery of the latent causal model and variables is guaranteed under uncoupled

 47%|████████████████████████████████████████████████████▌                                                          | 54/114 [01:41<04:28,  4.47s/it]

Found!
Trying title 'targeted reduction of causal models' with 'arxiv'


 48%|█████████████████████████████████████████████████████▌                                                         | 55/114 [01:42<03:15,  3.32s/it]

Found!
Trying title 'causal disentanglement of multimodal data' with 'arxiv'


 49%|██████████████████████████████████████████████████████▌                                                        | 56/114 [01:43<02:29,  2.59s/it]

Found!
Trying title 'o bject - centric architectures enable efficient causal representation learning' with 'arxiv'
Titles do not correspond :
o bject - centric architectures enable efficient causal representation learning
Object-centric architectures enable efficient causal representation learning
Got None when trying title 'o bject - centric architectures enable efficient causal representation learning' with 'arxiv'
Trying title 'o bject - centric architectures enable efficient causal representation learning' with 'semanticscholar'


 50%|███████████████████████████████████████████████████████▌                                                       | 57/114 [01:45<02:24,  2.54s/it]

Found!
Weird paperdict, some vital fields (author, year and/or title) are missing. Imputing what's missing with NA. Please check the original paperdict:  {'title': 'O BJECT - CENTRIC ARCHITECTURES ENABLE EFFICIENT CAUSAL REPRESENTATION LEARNING', 'ENTRYTYPE': 'misc', 'ID': 'None', 'abstract': 'only addresses injectivity failures, but also results in a significant reduction in the number of perturbations we need to observe to disentangle properties using Ahuja et al. (2022b)’s approach. We illustrate these results by developing a property disentanglement algorithm that combines Zhang et al. (2023)’s SA-MESH object-centric architecture with Ahuja et al. (2022b)’s approach to disentangle-ment and show that our approach is very effective at disentangling the properties of objects on both 2D and 3D synthetic benchmarks.'}
Trying title 'weight-variant latent causal models' with 'arxiv'
Titles do not correspond :
weight-variant latent causal models
Identifying Weight-Variant Latent Causal Mod

 51%|████████████████████████████████████████████████████████▍                                                      | 58/114 [01:48<02:25,  2.59s/it]

Found!
Trying title 'uni-fying causal representation learning with the invariance principle, september 2024. url http://arxiv.org/abs/2409.02772' with 'arxiv'
Error in Google Search without proxy - trying with proxy
Google search crashed
Got None when trying title 'uni-fying causal representation learning with the invariance principle, september 2024. url http://arxiv.org/abs/2409.02772' with 'arxiv'
Trying title 'uni-fying causal representation learning with the invariance principle, september 2024. url http://arxiv.org/abs/2409.02772' with 'semanticscholar'


Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 357, in search_google
    return next(url for url in google_search_module(query))
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
StopIteration

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 379, in search_arxiv
    id_list=[self.search_google(title + ' site:arxiv.org').split('/')[-1]],
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 364, in search_google
    raise "No proxy available! Error!"
TypeError: exceptions must derive from BaseException
Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 469, in paperdict
    paperdict = paperdict_methods_dict[so

Bug when trying title 'uni-fying causal representation learning with the invariance principle, september 2024. url http://arxiv.org/abs/2409.02772' with 'semanticscholar'
Got None when trying title 'uni-fying causal representation learning with the invariance principle, september 2024. url http://arxiv.org/abs/2409.02772' with 'semanticscholar'
Trying title 'on the identification of temporally causal representation with instantaneous dependence' with 'arxiv'


 53%|██████████████████████████████████████████████████████████▍                                                    | 60/114 [01:50<01:36,  1.78s/it]

Found!
Trying title 'a twist for graph classification: optimizing causal information flow in graph neural networks' with 'arxiv'
Titles do not correspond :
a twist for graph classification: optimizing causal information flow in graph neural networks
When Graph Neural Network Meets Causality: Opportunities, Methodologies and An Outlook
Got None when trying title 'a twist for graph classification: optimizing causal information flow in graph neural networks' with 'arxiv'
Trying title 'a twist for graph classification: optimizing causal information flow in graph neural networks' with 'semanticscholar'


 54%|███████████████████████████████████████████████████████████▍                                                   | 61/114 [01:53<01:49,  2.06s/it]

Found!
Trying title 'weakly supervised disentangled generative causal representation learning' with 'arxiv'


 54%|████████████████████████████████████████████████████████████▎                                                  | 62/114 [01:53<01:24,  1.63s/it]

Found!
Trying title 'invariant causal representation learning for out-of-distribution generalization' with 'arxiv'
Titles do not correspond :
invariant causal representation learning for out-of-distribution generalization
Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs
Got None when trying title 'invariant causal representation learning for out-of-distribution generalization' with 'arxiv'
Trying title 'invariant causal representation learning for out-of-distribution generalization' with 'semanticscholar'


 55%|█████████████████████████████████████████████████████████████▎                                                 | 63/114 [01:56<01:38,  1.93s/it]

Found!
Trying title 'identifying weight-variant latent causal models' with 'arxiv'


 56%|██████████████████████████████████████████████████████████████▎                                                | 64/114 [01:58<01:30,  1.82s/it]

Found!
Trying title 'causal reasoning: charting a revolutionary course for next-generation ai-native wireless networks' with 'arxiv'


 57%|███████████████████████████████████████████████████████████████▎                                               | 65/114 [01:59<01:28,  1.80s/it]

Found!
Trying title 'causal disentanglement with network information for debiased recommendations' with 'arxiv'


 58%|████████████████████████████████████████████████████████████████▎                                              | 66/114 [02:00<01:09,  1.44s/it]

Found!
Trying title 'learning causal representation for training cross-domain pose estimator via generative interventions' with 'arxiv'
Titles do not correspond :
learning causal representation for training cross-domain pose estimator via generative interventions
Causal Deep Learning
Got None when trying title 'learning causal representation for training cross-domain pose estimator via generative interventions' with 'arxiv'
Trying title 'learning causal representation for training cross-domain pose estimator via generative interventions' with 'semanticscholar'


 59%|█████████████████████████████████████████████████████████████████▏                                             | 67/114 [02:03<01:30,  1.92s/it]

Found!
Trying title 'disentangled generative causal representation learning' with 'arxiv'
Titles do not correspond :
disentangled generative causal representation learning
Weakly Supervised Disentangled Generative Causal Representation Learning
Got None when trying title 'disentangled generative causal representation learning' with 'arxiv'
Trying title 'disentangled generative causal representation learning' with 'semanticscholar'


 60%|██████████████████████████████████████████████████████████████████▏                                            | 68/114 [02:07<01:50,  2.40s/it]

Found!
Trying title 'towards the reusability and compositionality of causal representations' with 'arxiv'


 61%|███████████████████████████████████████████████████████████████████▏                                           | 69/114 [02:07<01:23,  1.86s/it]

Found!
Trying title 'biscuit: causal representation learning from binary interactions' with 'arxiv'


 61%|████████████████████████████████████████████████████████████████████▏                                          | 70/114 [02:08<01:04,  1.47s/it]

Found!
Trying title 'look, learn and leverage (l3): mitigating visual-domain shift and discovering intrinsic relations via symbolic alignment' with 'arxiv'


 62%|█████████████████████████████████████████████████████████████████████▏                                         | 71/114 [02:10<01:11,  1.66s/it]

Found!
Trying title 'shadow datasets, new challenging datasets for causal representation learning' with 'arxiv'


 63%|██████████████████████████████████████████████████████████████████████                                         | 72/114 [02:11<00:57,  1.36s/it]

Found!
Trying title 'causal diffusion autoencoders: toward counterfactual generation via diffusion probabilistic models' with 'arxiv'


 64%|███████████████████████████████████████████████████████████████████████                                        | 73/114 [02:11<00:46,  1.12s/it]

Found!
Trying title 'relation-first modeling paradigm for causal representation learning toward the development of agi' with 'arxiv'


 65%|████████████████████████████████████████████████████████████████████████                                       | 74/114 [02:13<00:51,  1.29s/it]

Found!
Trying title 'realization of causal representation learning to adjust confounding bias in latent space' with 'arxiv'


 66%|█████████████████████████████████████████████████████████████████████████                                      | 75/114 [02:14<00:44,  1.15s/it]

Found!
Trying title 'realization of causal representation learning and redefined dag for causal ai' with 'arxiv'
Titles do not correspond :
realization of causal representation learning and redefined dag for causal ai
Realization of Causal Representation Learning to Adjust Confounding Bias in Latent Space
Got None when trying title 'realization of causal representation learning and redefined dag for causal ai' with 'arxiv'
Trying title 'realization of causal representation learning and redefined dag for causal ai' with 'semanticscholar'


 67%|██████████████████████████████████████████████████████████████████████████                                     | 76/114 [02:26<02:53,  4.56s/it]

Found!
Trying title 'causal representation learning in temporal data via single-parent decoding' with 'arxiv'


 68%|██████████████████████████████████████████████████████████████████████████▉                                    | 77/114 [02:28<02:23,  3.89s/it]

Found!
Trying title 'causal representation learning through higher-level information extraction' with 'arxiv'
Titles do not correspond :
causal representation learning through higher-level information extraction
Causal Representation Learning Made Identifiable by Grouping of Observational Variables
Got None when trying title 'causal representation learning through higher-level information extraction' with 'arxiv'
Trying title 'causal representation learning through higher-level information extraction' with 'semanticscholar'


 68%|███████████████████████████████████████████████████████████████████████████▉                                   | 78/114 [02:33<02:23,  3.98s/it]

Found!
Trying title 'sparsity regularization via tree-structured environments for disentangled representations' with 'arxiv'


 69%|████████████████████████████████████████████████████████████████████████████▉                                  | 79/114 [02:35<02:04,  3.55s/it]

Found!
Trying title 'diffusion based causal representation learning' with 'arxiv'


 70%|█████████████████████████████████████████████████████████████████████████████▉                                 | 80/114 [02:37<01:42,  3.03s/it]

Found!
Trying title 'c-disentanglement: discovering causally-independent generative factors under an inductive bias of confounder' with 'arxiv'


 71%|██████████████████████████████████████████████████████████████████████████████▊                                | 81/114 [02:39<01:32,  2.81s/it]

Found!
Trying title 'scm-vae: learning identifiable causal representations via structural knowledge' with 'arxiv'
Titles do not correspond :
scm-vae: learning identifiable causal representations via structural knowledge
CausalVAE: Structured Causal Disentanglement in Variational Autoencoder
Got None when trying title 'scm-vae: learning identifiable causal representations via structural knowledge' with 'arxiv'
Trying title 'scm-vae: learning identifiable causal representations via structural knowledge' with 'semanticscholar'


 72%|███████████████████████████████████████████████████████████████████████████████▊                               | 82/114 [02:42<01:26,  2.70s/it]

Found!
Trying title 'model-free causal reinforcement learning with causal diagrams' with 'arxiv'
Titles do not correspond :
model-free causal reinforcement learning with causal diagrams
Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy
Got None when trying title 'model-free causal reinforcement learning with causal diagrams' with 'arxiv'
Trying title 'model-free causal reinforcement learning with causal diagrams' with 'semanticscholar'


 73%|████████████████████████████████████████████████████████████████████████████████▊                              | 83/114 [02:45<01:26,  2.79s/it]

Found!
Trying title 'nonlinear invariant risk minimization: a causal approach' with 'arxiv'


 74%|█████████████████████████████████████████████████████████████████████████████████▊                             | 84/114 [02:45<01:03,  2.13s/it]

Found!
Trying title 'causal representation learning via counterfactual intervention' with 'arxiv'
Titles do not correspond :
causal representation learning via counterfactual intervention
Interventional Causal Representation Learning
Got None when trying title 'causal representation learning via counterfactual intervention' with 'arxiv'
Trying title 'causal representation learning via counterfactual intervention' with 'semanticscholar'


 75%|██████████████████████████████████████████████████████████████████████████████████▊                            | 85/114 [02:49<01:12,  2.49s/it]

Found!
Trying title 'causal factor disentanglement for few-shot domain adaptation in video prediction' with 'arxiv'
Titles do not correspond :
causal factor disentanglement for few-shot domain adaptation in video prediction
Disentangled Representation Learning
Got None when trying title 'causal factor disentanglement for few-shot domain adaptation in video prediction' with 'arxiv'
Trying title 'causal factor disentanglement for few-shot domain adaptation in video prediction' with 'semanticscholar'


 75%|███████████████████████████████████████████████████████████████████████████████████▋                           | 86/114 [02:51<01:12,  2.59s/it]

Found!
Trying title 'temporally disentangled representation learning under unknown nonstationarity' with 'arxiv'


 76%|████████████████████████████████████████████████████████████████████████████████████▋                          | 87/114 [02:52<00:53,  1.99s/it]

Found!
Trying title 'possible principles for aligned structure learning agents' with 'arxiv'


 77%|█████████████████████████████████████████████████████████████████████████████████████▋                         | 88/114 [02:53<00:41,  1.59s/it]

Found!
Trying title 'towards generalizable reinforcement learning via causality-guided self-adaptive representations' with 'arxiv'


 78%|██████████████████████████████████████████████████████████████████████████████████████▋                        | 89/114 [02:55<00:42,  1.69s/it]

Found!
Trying title 'unbiased semantic representation learning based on causal disentanglement for domain generalization' with 'arxiv'
Titles do not correspond :
unbiased semantic representation learning based on causal disentanglement for domain generalization
Disentangled Representation Learning
Got None when trying title 'unbiased semantic representation learning based on causal disentanglement for domain generalization' with 'arxiv'
Trying title 'unbiased semantic representation learning based on causal disentanglement for domain generalization' with 'semanticscholar'


 79%|███████████████████████████████████████████████████████████████████████████████████████▋                       | 90/114 [02:58<00:50,  2.10s/it]

Found!
Trying title 'causal disentanglement domain generalization for time-series signal fault diagnosis' with 'arxiv'
Titles do not correspond :
causal disentanglement domain generalization for time-series signal fault diagnosis
Causal Disentanglement Hidden Markov Model for Fault Diagnosis
Got None when trying title 'causal disentanglement domain generalization for time-series signal fault diagnosis' with 'arxiv'
Trying title 'causal disentanglement domain generalization for time-series signal fault diagnosis' with 'semanticscholar'


 80%|████████████████████████████████████████████████████████████████████████████████████████▌                      | 91/114 [03:01<00:54,  2.38s/it]

Found!
Trying title 'controllable image generation based on causal representation learning' with 'arxiv'
Titles do not correspond :
controllable image generation based on causal representation learning
Image Content Generation with Causal Reasoning
Got None when trying title 'controllable image generation based on causal representation learning' with 'arxiv'
Trying title 'controllable image generation based on causal representation learning' with 'semanticscholar'


 81%|█████████████████████████████████████████████████████████████████████████████████████████▌                     | 92/114 [03:05<01:03,  2.87s/it]

Found!
Trying title 'de-age confounder based causal representation learning for cuffless blood pressure estimation' with 'arxiv'


Traceback (most recent call last):
  File "/Users/oscar.clivio/cdllm_venv/lib/python3.12/site-packages/arxiv/__init__.py", line 626, in _parse_feed
    return self.__try_parse_feed(url, first_page=first_page, try_index=_try_index)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/cdllm_venv/lib/python3.12/site-packages/arxiv/__init__.py", line 663, in __try_parse_feed
    raise HTTPError(url, try_index, resp.status_code)
arxiv.HTTPError: Page request resulted in HTTP 400 (https://export.arxiv.org/api/query?search_query=&id_list=2024-08%3Fskip%3D2625%26show%3D2000&sortBy=submittedDate&sortOrder=descending&start=0&max_results=100)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/oscar.clivio/cdllm_venv/lib/python3.12/site-packages/arxiv/__init__.py", line 626, in _parse_feed
    return self.__try_parse_feed(url, first_page=first_page, try_index=_try_index)
 

Google search crashed
Got None when trying title 'de-age confounder based causal representation learning for cuffless blood pressure estimation' with 'arxiv'
Trying title 'de-age confounder based causal representation learning for cuffless blood pressure estimation' with 'semanticscholar'


 82%|██████████████████████████████████████████████████████████████████████████████████████████▌                    | 93/114 [03:18<02:04,  5.91s/it]

Found!
Trying title 'learning causal representations from general environments: identifiability and intrinsic ambiguity' with 'arxiv'


 82%|███████████████████████████████████████████████████████████████████████████████████████████▌                   | 94/114 [03:19<01:31,  4.56s/it]

Found!
Trying title 'interpretability is in the mind of the beholder: a causal framework for human-interpretable representation learning' with 'arxiv'


 83%|████████████████████████████████████████████████████████████████████████████████████████████▌                  | 95/114 [03:21<01:12,  3.79s/it]

Found!
Trying title 'measuring the effect of causal disentanglement on the adversarial robustness of neural network models' with 'arxiv'


 84%|█████████████████████████████████████████████████████████████████████████████████████████████▍                 | 96/114 [03:22<00:51,  2.86s/it]

Found!
Trying title 'learning causality-inspired representation consistency for video anomaly detection' with 'arxiv'


 85%|██████████████████████████████████████████████████████████████████████████████████████████████▍                | 97/114 [03:24<00:45,  2.68s/it]

Found!
Trying title 'leveraging task structures for improved identifiability in neural network representations' with 'arxiv'


 86%|███████████████████████████████████████████████████████████████████████████████████████████████▍               | 98/114 [03:25<00:35,  2.23s/it]

Found!
Trying title 'unbiased scene graph generation via two-stage causal modeling' with 'arxiv'


 87%|████████████████████████████████████████████████████████████████████████████████████████████████▍              | 99/114 [03:28<00:33,  2.25s/it]

Found!
Trying title 'towards causal representation learning and deconfounding from indefinite data' with 'arxiv'


 88%|████████████████████████████████████████████████████████████████████████████████████████████████▍             | 100/114 [03:28<00:25,  1.81s/it]

Found!
Trying title 'fault diagnosis for high‐speed train braking system based on disentangled causal representation learning' with 'arxiv'
Titles do not correspond :
fault diagnosis for high‐speed train braking system based on disentangled causal representation learning
Causal Machine Learning: A Survey and Open Problems
Got None when trying title 'fault diagnosis for high‐speed train braking system based on disentangled causal representation learning' with 'arxiv'
Trying title 'fault diagnosis for high‐speed train braking system based on disentangled causal representation learning' with 'semanticscholar'


 89%|█████████████████████████████████████████████████████████████████████████████████████████████████▍            | 101/114 [03:33<00:33,  2.55s/it]

Found!
Trying title 'a survey on causal representation learning and future work for medical image analysis' with 'arxiv'


 89%|██████████████████████████████████████████████████████████████████████████████████████████████████▍           | 102/114 [03:34<00:26,  2.18s/it]

Found!
Trying title 'language-based causal representation learning' with 'arxiv'


 90%|███████████████████████████████████████████████████████████████████████████████████████████████████▍          | 103/114 [03:37<00:26,  2.41s/it]

Found!
Trying title 'improving multi-task generalization via regularizing spurious correlation' with 'arxiv'


 91%|████████████████████████████████████████████████████████████████████████████████████████████████████▎         | 104/114 [03:41<00:28,  2.85s/it]

Found!
Trying title 'causal representation learning for out-of-distribution recommendation' with 'arxiv'
Titles do not correspond :
causal representation learning for out-of-distribution recommendation
Graph Representation Learning via Causal Diffusion for Out-of-Distribution Recommendation
Got None when trying title 'causal representation learning for out-of-distribution recommendation' with 'arxiv'
Trying title 'causal representation learning for out-of-distribution recommendation' with 'semanticscholar'


 92%|█████████████████████████████████████████████████████████████████████████████████████████████████████▎        | 105/114 [03:55<00:56,  6.29s/it]

Found!
Trying title 'translational lung imaging analysis through disentangled representations' with 'arxiv'


 93%|██████████████████████████████████████████████████████████████████████████████████████████████████████▎       | 106/114 [03:56<00:37,  4.68s/it]

Found!
Trying title 'on causally disentangled representations' with 'arxiv'


 94%|███████████████████████████████████████████████████████████████████████████████████████████████████████▏      | 107/114 [03:57<00:24,  3.52s/it]

Found!
Trying title '3didentbox: a toolbox for identifiability benchmarking' with 'arxiv'
Titles do not correspond :
3didentbox: a toolbox for identifiability benchmarking
Natural Counterfactuals With Necessary Backtracking
Got None when trying title '3didentbox: a toolbox for identifiability benchmarking' with 'arxiv'
Trying title '3didentbox: a toolbox for identifiability benchmarking' with 'semanticscholar'


 95%|████████████████████████████████████████████████████████████████████████████████████████████████████████▏     | 108/114 [04:00<00:20,  3.36s/it]

Found!
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'arxiv'
Titles do not correspond :
advances in causal representation learning: discovery of the hidden world
Discovery of the Hidden World with Large Language Models
Got None when trying title 'advances in causal representation learning: discovery of the hidden world' with 'arxiv'
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'semanticscholar'


 96%|█████████████████████████████████████████████████████████████████████████████████████████████████████████▏    | 109/114 [04:03<00:16,  3.39s/it]

Found!
Trying title 'an adversarial perspective on accuracy, robustness, fairness, and privacy: multilateral-tradeoffs in trustworthy ml' with 'arxiv'
Titles do not correspond :
an adversarial perspective on accuracy, robustness, fairness, and privacy: multilateral-tradeoffs in trustworthy ml
SoK: Unintended Interactions among Machine Learning Defenses and Risks
Got None when trying title 'an adversarial perspective on accuracy, robustness, fairness, and privacy: multilateral-tradeoffs in trustworthy ml' with 'arxiv'
Trying title 'an adversarial perspective on accuracy, robustness, fairness, and privacy: multilateral-tradeoffs in trustworthy ml' with 'semanticscholar'


 96%|██████████████████████████████████████████████████████████████████████████████████████████████████████████▏   | 110/114 [04:06<00:13,  3.27s/it]

Found!
Trying title 'causalvae: structured causal disentanglement in variational autoencoder' with 'arxiv'


 97%|███████████████████████████████████████████████████████████████████████████████████████████████████████████   | 111/114 [04:07<00:07,  2.59s/it]

Found!
Trying title 'score-based causal representation learning: linear and general transformations' with 'arxiv'
Error in Google Search without proxy - trying with proxy
Google search crashed
Got None when trying title 'score-based causal representation learning: linear and general transformations' with 'arxiv'
Trying title 'score-based causal representation learning: linear and general transformations' with 'semanticscholar'


Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 357, in search_google
    return next(url for url in google_search_module(query))
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
StopIteration

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 379, in search_arxiv
    id_list=[self.search_google(title + ' site:arxiv.org').split('/')[-1]],
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 364, in search_google
    raise "No proxy available! Error!"
TypeError: exceptions must derive from BaseException
 98%|████████████████████████████████████████████████████████████████████████████████████████████████████████████  | 112/114 [04:09<00:04,  2.35s/it]

Found!
Trying title 'causal disentanglement' with 'arxiv'
Titles do not correspond :
causal disentanglement
Learning Causally Disentangled Representations via the Principle of Independent Causal Mechanisms
Got None when trying title 'causal disentanglement' with 'arxiv'
Trying title 'causal disentanglement' with 'semanticscholar'


 99%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████ | 113/114 [04:13<00:02,  2.77s/it]

Titles do not correspond :
causal disentanglement
Identifiability Guarantees for Causal Disentanglement from Soft Interventions
Got None when trying title 'causal disentanglement' with 'semanticscholar'
Trying title 'celcomen: spatial causal disentanglement for single-cell and tissue perturbation modeling' with 'arxiv'


100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████| 114/114 [04:15<00:00,  2.24s/it]


Found!


  0%|                                                                                                                          | 0/6 [00:00<?, ?it/s]

 
title requested  :  linear causal representation learning from unknown multi-node interventions
title found  :  Linear Causal Representation Learning from Unknown Multi-node Interventions
abstract  :  Despite the multifaceted recent advances in interventional causal representation learning (CRL), they primarily focus on the stylized assumption of single-node interventions. This assumption is not valid in a wide range of applications, and generally, the subset of nodes intervened in an interventional environment is fully unknown. This paper focuses on interventional CRL under unknown multi-node (UMN) interventional environments and establishes the first identifiability results for general latent causal models (parametric or nonparametric) under stochastic interventions (soft or hard) and linear transformation from the latent to observed space. Specifically, it is established that given sufficiently diverse interventional environments, (i) identifiability up to ancestors is possible us

Add this title?  (y/n):  y


 17%|██████████████████▊                                                                                              | 1/6 [04:18<21:31, 258.28s/it]

 
title requested  :  general identifiability and achievability for causal representation learning
title found  :  General Identifiability and Achievability for Causal Representation Learning
abstract  :  This paper focuses on causal representation learning (CRL) under a general nonparametric latent causal model and a general transformation model that maps the latent data to the observational data. It establishes identifiability and achievability results using two hard uncoupled interventions per node in the latent causal graph. Notably, one does not know which pair of intervention environments have the same node intervened (hence, uncoupled). For identifiability, the paper establishes that perfect recovery of the latent causal model and variables is guaranteed under uncoupled interventions. For achievability, an algorithm is designed that uses observational and interventional data and recovers the latent causal model and variables with provable guarantees. This algorithm leverages sco

Add this title?  (y/n):  y


 33%|█████████████████████████████████████▋                                                                           | 2/6 [04:19<07:07, 106.91s/it]

 
title requested  :  score-based causal representation learning from interventions: nonparametric identifiability
title found  :  Score-based Causal Representation Learning from Interventions: Nonparametric Identifiability
abstract  :  This paper focuses on causal representation learning (CRL) under a general non-parametric causal latent model and a general transformation model that maps the latent data to the observational data. It establishes identifiability and achievability results using two (stochastic) hard uncoupled interventions per node in the latent causal graph. Notably, one does not know which pair of intervention environments have the same node intervened (hence, uncoupled environments). For identifiability, the paper establishes that perfect recovery of the latent causal model and variables is guaranteed under uncoupled interventions. For achievability, an algorithm is designed that uses observational and interventional data and recovers the latent causal model and varia

Add this title?  (y/n):  y


 50%|█████████████████████████████████████████████████████████                                                         | 3/6 [04:19<02:54, 58.27s/it]

 
title requested  :  learning causal representations from general environments: identifiability and intrinsic ambiguity
title found  :  Learning Causal Representations from General Environments: Identifiability and Intrinsic Ambiguity
abstract  :  We study causal representation learning, the task of recovering high-level latent variables and their causal relationships in the form of a causal graph from low-level observed data (such as text and images), assuming access to observations generated from multiple environments. Prior results on the identifiability of causal representations typically assume access to single-node interventions which is rather unrealistic in practice, since the latent variables are unknown in the first place. In this work, we provide the first identifiability results based on data that stem from general environments. We show that for linear causal models, while the causal graph can be fully recovered, the latent variables are only identified up to the surrounde

Add this title?  (y/n):  y


 67%|████████████████████████████████████████████████████████████████████████████                                      | 4/6 [04:20<01:10, 35.42s/it]

 
title requested  :  advances in causal representation learning: discovery of the hidden world
title found  :  Advances in Causal Representation Learning: Discovery of the Hidden World
abstract  :  Can we find the causal direction between two random variables without temporal precedence information? How can we figure out where latent causal variables should be and how they are related? In our daily life and science, people often attempt to answer such causal questions for the purpose of understanding, proper manipulation of systems, and robust prediction under interventions. Accordingly, finding causality and making use of it is an essential problem in scientific discovery and engineering. Traditional causal discovery approaches [1], such as the PC algorithm and GES, mainly focus on finding causal relations among measured variables, even in the presence of latent confounders (see, e.g., the FCI algorithm). However, in a wide range of real problems, we even do not know what the causal 

Add this title?  (y/n):  y


 83%|███████████████████████████████████████████████████████████████████████████████████████████████                   | 5/6 [04:20<00:22, 22.77s/it]

 
title requested  :  score-based causal representation learning: linear and general transformations
title found  :  Score-based Causal Representation Learning: Linear and General Transformations
abstract  :  This paper addresses intervention-based causal representation learning (CRL) under a general nonparametric latent causal model and an unknown transformation that maps the latent variables to the observed variables. Linear and general transformations are investigated. The paper addresses both the identifiability and achievability aspects. Identifiability refers to determining algorithm-agnostic conditions that ensure recovering the true latent causal variables and the latent causal graph underlying them. Achievability refers to the algorithmic aspects and addresses designing algorithms that achieve identifiability guarantees. By drawing novel connections between score functions (i.e., the gradients of the logarithm of density functions) and CRL, this paper designs a score-based cla

Add this title?  (y/n):  y


100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [04:20<00:00, 43.47s/it]


In [15]:
for idx, title in enumerate(crl_titles_filtered):
    print(idx+1, ' : ', title)

1  :  linear causal representation learning from unknown multi-node interventions
2  :  general identifiability and achievability for causal representation learning
3  :  score-based causal representation learning from interventions: nonparametric identifiability
4  :  learning causal representations from general environments: identifiability and intrinsic ambiguity
5  :  advances in causal representation learning: discovery of the hidden world
6  :  score-based causal representation learning: linear and general transformations


We download the papers in the home folder. Note that we could download as many papers as we want, including all previously found CRL papers. To do so, the titles are searched from arXiV, then Google Scholar if the former does not work, then Semantic Scholar (which can be set up in the "sources" argument of the method, see the code). Note that Google Scholar typically requires a proxy to work, as the API is VERY quick to raise a rate limit error.

In [16]:
lt.download(crl_titles_filtered, '~')

  0%|                                                                                                                          | 0/6 [00:00<?, ?it/s]

Trying title 'linear causal representation learning from unknown multi-node interventions' with 'arxiv'


 17%|███████████████████                                                                                               | 1/6 [00:01<00:09,  1.99s/it]

Found!
Trying title 'general identifiability and achievability for causal representation learning' with 'arxiv'


 33%|██████████████████████████████████████                                                                            | 2/6 [00:04<00:09,  2.31s/it]

Found!
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'arxiv'
Titles do not correspond :
score-based causal representation learning from interventions: nonparametric identifiability
Score-based Causal Representation Learning with Interventions
Got None when trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'arxiv'
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'googlescholar'


 50%|█████████████████████████████████████████████████████████                                                         | 3/6 [00:12<00:14,  4.90s/it]

Found!
Trying title 'learning causal representations from general environments: identifiability and intrinsic ambiguity' with 'arxiv'


 67%|████████████████████████████████████████████████████████████████████████████                                      | 4/6 [00:13<00:06,  3.19s/it]

Found!
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'arxiv'
Titles do not correspond :
advances in causal representation learning: discovery of the hidden world
Discovery of the Hidden World with Large Language Models
Got None when trying title 'advances in causal representation learning: discovery of the hidden world' with 'arxiv'
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'googlescholar'


 83%|███████████████████████████████████████████████████████████████████████████████████████████████                   | 5/6 [00:21<00:04,  4.94s/it]

Found!
Trying title 'score-based causal representation learning: linear and general transformations' with 'arxiv'


100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:23<00:00,  3.89s/it]

Found!





Downloaded file varici2023sbcrlfini.pdf from https://openreview.net/pdf?id=MytNJ6lXAV
Downloaded file varıcı2024lcrlfumni.pdf from http://arxiv.org/pdf/2406.05937v1
Downloaded file varıcı2024giaafcrl.pdf from http://arxiv.org/pdf/2310.15450v2
Downloaded file jin2024lcrfgeiaia.pdf from http://arxiv.org/pdf/2311.12267v2
Downloaded file varıcı2024sbcrllagt.pdf from http://arxiv.org/pdf/2402.00849v3
Downloaded file zhangNAaicrldothw.pdf from http://cogsys.org/symposium/discovery-2023/abstracts/Abstract_5108.pdf


[{'venue': '… Learning Workshop at …',
  'title': 'Score-based causal representation learning from interventions: Nonparametric identifiability',
  'year': '2023',
  'booktitle': 'Causal Representation Learning Workshop at NeurIPS 2023',
  'author': 'Varici, Burak and Acart{\\"u}rk, Emre and Shanmugam, Karthikeyan and Tajer, Ali',
  'abstract': 'This paper focuses on causal representation learning (CRL) under a general nonparametric causal latent model and a general transformation model that maps the latent data to the observational data. It establishes** identifiability** and** achievability** results using two (stochastic) hard** uncoupled** interventions per node in the latent causal graph. Notably, one does not know which pair of intervention environments have the same node intervened (hence, uncoupled environments). For identifiability, the paper establishes that perfect',
  'ENTRYTYPE': 'inproceedings',
  'ID': 'varici2023sbcrlfini',
  'url': 'https://openreview.net/pdf?id=MytNJ6

Finally, we print their bibtexs. These were actually also retrieved and used in the former download method, papers are then downloaded from the "url" field of the bibtexs. 

In [17]:
bibtexs = lt.bibtexs(crl_titles_filtered)

  0%|                                                                                                                          | 0/6 [00:00<?, ?it/s]

Trying title 'linear causal representation learning from unknown multi-node interventions' with 'arxiv'


 17%|███████████████████                                                                                               | 1/6 [00:01<00:07,  1.55s/it]

Found!
Trying title 'general identifiability and achievability for causal representation learning' with 'arxiv'


 33%|██████████████████████████████████████                                                                            | 2/6 [00:03<00:07,  1.88s/it]

Found!
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'arxiv'
Titles do not correspond :
score-based causal representation learning from interventions: nonparametric identifiability
Score-based Causal Representation Learning with Interventions
Got None when trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'arxiv'
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'own'
Bug when trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'own'
Got None when trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'own'
Trying title 'score-based causal representation learning from interventions: nonparametric identifiability' with 'googlescholar'


Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 469, in paperdict
    paperdict = paperdict_methods_dict[source](title)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 436, in _paperdict_own
    if os.path.exists(self.folder):
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<frozen genericpath>", line 19, in exists
TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType
 50%|█████████████████████████████████████████████████████████                                                         | 3/6 [00:10<00:12,  4.02s/it]

Found!
Trying title 'learning causal representations from general environments: identifiability and intrinsic ambiguity' with 'arxiv'


 67%|████████████████████████████████████████████████████████████████████████████                                      | 4/6 [00:10<00:05,  2.64s/it]

Found!
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'arxiv'
Titles do not correspond :
advances in causal representation learning: discovery of the hidden world
Discovery of the Hidden World with Large Language Models
Got None when trying title 'advances in causal representation learning: discovery of the hidden world' with 'arxiv'
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'own'
Bug when trying title 'advances in causal representation learning: discovery of the hidden world' with 'own'
Got None when trying title 'advances in causal representation learning: discovery of the hidden world' with 'own'
Trying title 'advances in causal representation learning: discovery of the hidden world' with 'googlescholar'


Traceback (most recent call last):
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 469, in paperdict
    paperdict = paperdict_methods_dict[source](title)
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/oscar.clivio/causal_discovery_llm/researchtools/litrevtools.py", line 436, in _paperdict_own
    if os.path.exists(self.folder):
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "<frozen genericpath>", line 19, in exists
TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType
 83%|███████████████████████████████████████████████████████████████████████████████████████████████                   | 5/6 [00:17<00:04,  4.18s/it]

Found!
Trying title 'score-based causal representation learning: linear and general transformations' with 'arxiv'


100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 6/6 [00:19<00:00,  3.27s/it]

Found!





In [18]:
print(bibtexs)

@misc{jin2024lcrfgeiaia,
 abstract = {We study causal representation learning, the task of recovering high-level latent variables and their causal relationships in the form of a causal graph from low-level observed data (such as text and images), assuming access to observations generated from multiple environments. Prior results on the identifiability of causal representations typically assume access to single-node interventions which is rather unrealistic in practice, since the latent variables are unknown in the first place. In this work, we provide the first identifiability results based on data that stem from general environments. We show that for linear causal models, while the causal graph can be fully recovered, the latent variables are only identified up to the surrounded-node ambiguity (SNA) \citep{varici2023score}. We provide a counterpart of our guarantee, showing that SNA is basically unavoidable in our setting. We also propose an algorithm, \texttt{LiNGCReL} which provably