-
Notifications
You must be signed in to change notification settings - Fork 247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update the Dataset Object Documentation #500
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really like the writing style!
Some comments:
- I didn't comment on wordings
- I think we need to show more functionalities about the dataset, for example functions like copy, from_numpy, datetime parsing, passing labels as series, and more.
- We can take inspiration from a not-so-related library which in my opinion has a great quickstart:
https://numpy.org/doc/stable/user/quickstart.html
74609a6
to
0b4f0df
Compare
- features | ||
List of column names. This is the features that are passed to the model. If not defined, columns not defined as something else is considered a feature. | ||
- cat_features | ||
List of column names. A subset of the features. Categorical features normally require some preprocessing before being passed to the model. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
List of column names. A subset of the features. Categorical features normally require some preprocessing before being passed to the model. | |
List of column names. A subset of the features. Categorical features normally require some preprocessing before being passed to the model. If not specified, the categorical features are inferred automatically from the data itself. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And also we should line here to the categorical inference heuristic detailed below.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See my general note above about phrasing
List of column names. This is the features that are passed to the model. If not defined, columns not defined as something else is considered a feature. | ||
- cat_features | ||
List of column names. A subset of the features. Categorical features normally require some preprocessing before being passed to the model. | ||
- label |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually, when reading the doc I think we should rename the label parameter to target.
What do you guys think?
@shir22 @noamzbr @benisraeldan @matanper @nirhutnik @JKL98ISR
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tend to agree that target feels a bit smoother
List of column names. This is the features that are passed to the model. If not defined, columns not defined as something else is considered a feature. | ||
- cat_features | ||
List of column names. A subset of the features. Categorical features normally require some preprocessing before being passed to the model. | ||
- label |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tend to agree that target feels a bit smoother
- features | ||
List of column names. This is the features that are passed to the model. If not defined, columns not defined as something else is considered a feature. | ||
- cat_features | ||
List of column names. A subset of the features. Categorical features normally require some preprocessing before being passed to the model. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
See my general note above about phrasing
…dc-484-dataset-object-documentation
Update to user guide Small updates to readme and index
* 0.2.0 version bump * docs fixes * Fix PerfectModel when data have nulls in label (#526) * - Adding iris to the datasets section (#530) - changing quickstart to use iris from the datasets section * Update the Dataset Object Documentation (#500) Update to user guide Small updates to readme and index * version bump Co-authored-by: matanper <matan@deepchecks.com> Co-authored-by: DBI <42312361+benisraeldan@users.noreply.github.com>
Reference Issues/PRs
Resolves #484