PC algo only working with int data inputs #69

robertness · 2022-12-09T20:15:49Z

Right now, the PC algorithm I believe requires discrete variables to be integers instead of characters.
I tried running PC on this data:

A	S	T	L	B	E	X	D
no	yes	no	no	yes	no	no	yes
no	yes	no	no	no	no	no	no
no	no	yes	no	no	yes	yes	yes
no	no	no	no	yes	no	no	yes
no	no	no	no	no	no	no	yes

But it threw an error. To get it to work I had to convert the values to ints.

def convert_to_int(df):
    for var in df.columns:
        data[var] = [1 if x == "yes" else 0 for x in data[var]]
    return df
data_mod = convert_to_int(data)

pc.fit(data_mod, context)

Calling this a bug. pc.fit(data, context) should work.

The text was updated successfully, but these errors were encountered:

adam2392 · 2022-12-09T22:31:54Z

Could the user just call an Encoder preprocessing function from scikit-learn? Or should we add that step for them? Either way good catch, we should document this accordingly for any categorical/discrete tests.

robertness mentioned this issue Dec 9, 2022

PC tutorial using ASIA data #67

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PC algo only working with int data inputs #69

PC algo only working with int data inputs #69

robertness commented Dec 9, 2022 •

edited

Loading

adam2392 commented Dec 9, 2022

PC algo only working with int data inputs #69

PC algo only working with int data inputs #69

Comments

robertness commented Dec 9, 2022 • edited Loading

adam2392 commented Dec 9, 2022

robertness commented Dec 9, 2022 •

edited

Loading