Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dict target #34

Closed
ilyes495 opened this issue Aug 1, 2017 · 2 comments
Closed

dict target #34

ilyes495 opened this issue Aug 1, 2017 · 2 comments

Comments

@ilyes495
Copy link

ilyes495 commented Aug 1, 2017

I have a question regarding the format of target dataframe which is passed to fit_predict(), does it contain the labels of training dataset or for test dataset as well ? and how it should be formatted?
I think it's not clear if someone has to process the data himself before passing it to the predictor.

Thanks

@AxeldeRomblay
Copy link
Owner

The "target dataframe" (actually it must be a pandas Serie) must contain the labels of training dataset only.

  • The indexes of the target should also match with the train dataset.
  • The serie's name must correspond to your target name
  • dtype = 'int' for classif (as you're supposed to have encoded it before) or dtype = 'float' for regression

Also if you're dealing with a classification problem, you need to have the target encoder python object in the directory "save" (LabelEncoderfitted) and saved as "target_encoder.obj"

@AxeldeRomblay
Copy link
Owner

I will add more details to the docs tomorrow ! Thanks for reporting this issue !

@ilyes495 ilyes495 closed this as completed Aug 2, 2017
AxeldeRomblay added a commit that referenced this issue Aug 2, 2017
add examples for Optimiser
add details for df format (issue "dict target #34")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants