Skip to content

Documentation mismatch about categorical variable handling in Decision Trees #18105

@hiteshsom

Description

@hiteshsom

The below link specifies that the of decsion trees are

  • Able to handle both numerical and categorical data. Other techniques are usually specialised in analysing datasets that have only one type of variable. See algorithms for more information.

image

But if you click on more in the above point it takes you to page
https://scikit-learn.org/stable/modules/tree.html#tree-algorithms

and there is a sentence saying: scikit-learn uses an optimised version of the CART algorithm; however, scikit-learn implementation does not support categorical variables for now.

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions