Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vectorizers to accept Pandas Series as input #4811

Conversation

shaswat-indian
Copy link
Contributor

Resolves #3403

This PR adds support for using pandas.Series as an input to TfidfVectorizer, HashingVectorizer and CountVectorizer.

raydouglass and others added 4 commits June 7, 2022 11:25
Pin max version of `cuda-python` to `11.7.0`

This is a back port of rapidsai#4793.

Authors:
   - Jordan Jacobelli (https://github.com/Ethyling)

Approvers:
…into fea_vectorizer_pd_series_support

� Conflicts:
�	CHANGELOG.md
�	conda/environments/cuml_dev_cuda11.0.yml
�	conda/environments/cuml_dev_cuda11.2.yml
�	conda/environments/cuml_dev_cuda11.4.yml
�	conda/environments/cuml_dev_cuda11.5.yml
@shaswat-indian shaswat-indian requested a review from a team as a code owner July 13, 2022 00:40
@github-actions github-actions bot added the Cython / Python Cython or Python issue label Jul 13, 2022
@beckernick beckernick added feature request New feature or request 3 - Ready for Review Ready for review by team non-breaking Non-breaking change labels Jul 13, 2022
@shaswat-indian
Copy link
Contributor Author

rerun tests

@shaswat-indian
Copy link
Contributor Author

rerun tests

@caryr35 caryr35 added this to PR-WIP in v22.08 Release via automation Jul 19, 2022
@caryr35 caryr35 moved this from PR-WIP to PR-Needs review in v22.08 Release Jul 19, 2022
v22.08 Release automation moved this from PR-Needs review to PR-Reviewer approved Jul 29, 2022
@dantegd
Copy link
Member

dantegd commented Jul 29, 2022

@gpucibot merge

@rapids-bot rapids-bot bot merged commit ddf620f into rapidsai:branch-22.08 Jul 29, 2022
v22.08 Release automation moved this from PR-Reviewer approved to Done Jul 29, 2022
jakirkham pushed a commit to jakirkham/cuml that referenced this pull request Feb 27, 2023
Resolves rapidsai#3403

This PR adds support for using `pandas.Series` as an input to `TfidfVectorizer`, `HashingVectorizer` and `CountVectorizer`.

Authors:
  - Shaswat Anand (https://github.com/shaswat-indian)
  - Ray Douglass (https://github.com/raydouglass)

Approvers:
  - Dante Gama Dessavre (https://github.com/dantegd)

URL: rapidsai#4811
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3 - Ready for Review Ready for review by team Cython / Python Cython or Python issue feature request New feature or request non-breaking Non-breaking change
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

[FEA] TfidfVectorizer to accept Pandas Series as input
4 participants