Skip to content

Fix bow model#363

Merged
cristian-tamblay merged 2 commits into
developfrom
fix/bow-model
Oct 30, 2025
Merged

Fix bow model#363
cristian-tamblay merged 2 commits into
developfrom
fix/bow-model

Conversation

@Irozuku
Copy link
Copy Markdown
Collaborator

@Irozuku Irozuku commented Oct 29, 2025

This pull request updates the tokenization logic in the bag-of-words text classification model to ensure that all input columns are removed after tokenization, rather than just the "text" column. This change improves compatibility with datasets that may have different or multiple column names.

Improvements to dataset preprocessing:

  • Updated the fit method in bow_text_classification_model.py to remove all input columns after tokenization by passing x.column_names to the remove_columns argument.
  • Updated the predict method in bow_text_classification_model.py to similarly remove all input columns after tokenization.

@cristian-tamblay cristian-tamblay merged commit 3d74d0f into develop Oct 30, 2025
18 checks passed
@cristian-tamblay cristian-tamblay deleted the fix/bow-model branch October 30, 2025 17:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants