How to handle named variables

Hi,

I have been working with the causal-learn package for a couple of weeks now and I was wondering if there is a plan to add support for named variables? 

As I understand it, at the moment data is generally assumed to take the form of a 2d np.array where each column contains a value for a particular variable `X1, X2, ..., XN`. I can quite easily take a csv file and convert it to this format but, in doing so, I lose the names of my columns. This relies on me having to keep track of which variable (`X1, X2, ..., XN`) corresponds to which named column of my csv data, which makes tasks such as visualising the output quite difficult.

I have been using the `to_pyplot` method to draw the discovered DAG and passing it the labels. However, this relies on the order being preserved (simply replacing X1 with the first label, X2 with the second, and so on). This has tripped me a couple of times when comparing two graphs produced from similar data where the variables do not align.

I am not sure what the solution is here, but it would be really useful if it were possible to attach actual variable names to the graph rather than doing this manually after computation. One possibility would be to support pandas dataframes for reading in data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to handle named variables #38

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to handle named variables #38

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions