KPrototypes fit_predict error: "could not convert string to float" #47

kroscek · 2017-07-11T12:19:03Z

I want to apply Kprototype into my dataset but it seems that the code can't convert into numpy arrays?
km = kprototypes.KPrototypes(n_clusters=10, init='Cao', verbose=2)
train=pd.read_csv('/home/lemma/train.csv')
train['clusters_KModes'] = km.fit_predict(train1,categorical=[1])
ValueError: could not convert string to float: MJ

Trying to convert into object to match the example given also not successful:
km = kprototypes.KPrototypes(n_clusters=10, init='Cao', verbose=2)
train=pd.read_csv('/home/lemma/train.csv')
train1=train1.values.astype(object)
train['clusters_KModes'] = km.fit_predict(train1,categorical=[1])
ValueError: could not convert string to float: MJ

nicodv · 2017-07-11T17:30:33Z

Could you post the full traceback, so I can see where the error occurs exactly?

Also, what does your data look like?

kroscek · 2017-07-15T14:20:32Z

My data contains both numerical (0.xxx)and categorical of one and two alphabet (A,B,XZ). Here is the snippet:

km = kprototypes.KPrototypes(n_clusters=10, init='Cao', verbose=2)
train1=pd.read_csv('/home/lemma/train.csv')
train1=train1.drop(['id','loss'],1)
train1=train1.values.astype(object)
train1['clusters_KModes'] = km.fit_predict(train1,categorical=[1,2,3])

nicodv · 2017-07-18T22:23:26Z

With categorical=[1,2,3], you're telling the algorithm that the second, third and fourth columns are categorical. But from your screenshot it looks like there's many more categorical variables. That why kmodes is trying to interpret the 'MJ' string as a float.

kroscek · 2017-07-19T02:49:03Z

Hi categorical=[1,2,3] is just for snipped only, The actual categorical written in python is ranging from 1-116. Thus I need to do treatment for categorical data because two alphabetic categorical cannot be converted into float?

nicodv added the bug label Jul 11, 2017

nicodv changed the title ~~[HELP] Kprototype fit_predict error: could not convert string to float?~~ KPrototypes fit_predict error: "could not convert string to float" Jul 11, 2017

nicodv closed this as completed Jul 18, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KPrototypes fit_predict error: "could not convert string to float" #47

KPrototypes fit_predict error: "could not convert string to float" #47

kroscek commented Jul 11, 2017 •

edited

Loading

nicodv commented Jul 11, 2017

kroscek commented Jul 15, 2017 •

edited

Loading

nicodv commented Jul 18, 2017

kroscek commented Jul 19, 2017 •

edited

Loading

KPrototypes fit_predict error: "could not convert string to float" #47

KPrototypes fit_predict error: "could not convert string to float" #47

Comments

kroscek commented Jul 11, 2017 • edited Loading

nicodv commented Jul 11, 2017

kroscek commented Jul 15, 2017 • edited Loading

nicodv commented Jul 18, 2017

kroscek commented Jul 19, 2017 • edited Loading

kroscek commented Jul 11, 2017 •

edited

Loading

kroscek commented Jul 15, 2017 •

edited

Loading

kroscek commented Jul 19, 2017 •

edited

Loading