📦 Bump pypi:scikit-learn:0.24.0 from 0.24.0 to 1.5.0 #21
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Lineaje has automatically created this pull request to resolve the following CVEs:
TfidfVectorizer, specifically in versions up to and including 1.4.1.post1, which
was fixed in version 1.5.0. The vulnerability arises from the unexpected storage
of all tokens present in the training data within the
stop_words_
attribute,rather than only storing the subset of tokens required for the TF-IDF technique
to function. This behavior leads to the potential leakage of sensitive
information, as the
stop_words_
attribute could contain tokens that were meantto be discarded and not stored, such as passwords or keys. The impact of this
vulnerability varies based on the nature of the data being processed by the
vectorizer.
You can merge this PR once the tests pass and the changes are reviewed.
Thank you for reviewing the update! 🚀