Skip to content
This repository was archived by the owner on Apr 12, 2026. It is now read-only.

chore(deps): bump sentence-transformers from 2.7.0 to 3.0.0#132

Merged
Daethyra merged 1 commit intostreamlitfrom
dependabot/pip/sentence-transformers-3.0.0
Jun 7, 2024
Merged

chore(deps): bump sentence-transformers from 2.7.0 to 3.0.0#132
Daethyra merged 1 commit intostreamlitfrom
dependabot/pip/sentence-transformers-3.0.0

Conversation

@dependabot
Copy link
Copy Markdown
Contributor

@dependabot dependabot bot commented on behalf of github Jun 3, 2024

Bumps sentence-transformers from 2.7.0 to 3.0.0.

Release notes

Sourced from sentence-transformers's releases.

v3.0.0 - Sentence Transformer Training Refactor; new similarity methods; hyperparameter optimization; 50+ datasets release

This release consists of a major refactor that overhauls the training approach (introducing multi-gpu training, bf16, loss logging, callbacks, and much more), adds convenient similarity and similarity_pairwise methods, adds extra keyword arguments, introduces Hyperparameter Optimization, and includes a massive reformatting and release of 50+ datasets for training embedding models. In total, this is the largest Sentence Transformers update since the project was first created.

Install this version with

# Full installation:
pip install sentence-transformers[train]==3.0.0

Inference only:

pip install sentence-transformers==3.0.0

Sentence Transformer training refactor (#2449)

The v3.0 release centers around this huge modernization of the training approach for SentenceTransformer models. Whereas training before v3.0 used to be all about InputExample, DataLoader and model.fit, the new training approach relies on 5 new components. You can learn more about these components in our Training and Finetuning Embedding Models with Sentence Transformers v3 blogpost. Additionally, you can read the new Training Overview, check out the Training Examples, or read this summary:

  1. Dataset A training Dataset or DatasetDict. This class is much more suited for sharing & efficient modifications than lists/DataLoaders of InputExample instances. A Dataset can contain multiple text columns that will be fed in order to the corresponding loss function. So, if the loss expects (anchor, positive, negative) triplets, then your dataset should also have 3 columns. The names of these columns are irrelevant. If there is a "label" or "score" column, it is treated separately, and used as the labels during training. A DatasetDict can be used to train with multiple datasets at once, e.g.:
    DatasetDict({
        multi_nli: Dataset({
            features: ['premise', 'hypothesis', 'label'],
            num_rows: 392702
        })
        snli: Dataset({
            features: ['snli_premise', 'hypothesis', 'label'],
            num_rows: 549367
        })
        stsb: Dataset({
            features: ['sentence1', 'sentence2', 'label'],
            num_rows: 5749
        })
    })
    When a DatasetDict is used, the loss parameter to the SentenceTransformerTrainer must also be a dictionary with these dataset keys, e.g.:
    {
        'multi_nli': SoftmaxLoss(...),
        'snli': SoftmaxLoss(...),
        'stsb': CosineSimilarityLoss(...),
    }
  2. Loss Function A loss function, or a dictionary of loss functions like described above. These loss functions do not require changes compared to before this PR.
  3. Training Arguments A SentenceTransformerTrainingArguments instance, subclass of a TrainingArguments instance. This powerful class controls the specific details of the training.
  4. Evaluator An optional SentenceEvaluator instance. Unlike before, models can now be evaluated both on an evaluation dataset with some loss function and/or a SentenceEvaluator instance.
  5. Trainer

... (truncated)

Commits
  • ba908be Release v3.0.0
  • e55a6d1 Merge v3.0 pre-release into master, prepare for full v3.0 release (#2685)
  • 85890d5 Update training overview docs based on the blogpost reviews
  • 946a97d Add missing docstrings arguments for Cached... losses
  • 649a31c Add missing subset in quora-duplicates
  • 2f89fd6 Link directly to the 2DMSE preprint
  • 3f5dccb Update phrasing on local_files_only docstring
  • 403d188 Link to the yet-to-be-released release notes instead
  • a373931 Docs: pairwise_cosine_similarity -> pairwise_similarity
  • 24bee09 Fix formatting of Python modules
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot dependabot bot added the dependencies Pull requests that update a dependency file label Jun 3, 2024
@Daethyra Daethyra added the wontfix This will not be worked on label Jun 4, 2024
@dependabot dependabot bot force-pushed the dependabot/pip/sentence-transformers-3.0.0 branch 3 times, most recently from 3c11dbf to 81b40d9 Compare June 7, 2024 02:10
Bumps [sentence-transformers](https://github.com/UKPLab/sentence-transformers) from 2.7.0 to 3.0.0.
- [Release notes](https://github.com/UKPLab/sentence-transformers/releases)
- [Commits](huggingface/sentence-transformers@v2.7.0...v3.0.0)

---
updated-dependencies:
- dependency-name: sentence-transformers
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot force-pushed the dependabot/pip/sentence-transformers-3.0.0 branch from 81b40d9 to 9257720 Compare June 7, 2024 02:31
@Daethyra Daethyra enabled auto-merge June 7, 2024 02:31
@Daethyra Daethyra merged commit 603a64e into streamlit Jun 7, 2024
@dependabot dependabot bot deleted the dependabot/pip/sentence-transformers-3.0.0 branch June 7, 2024 02:34
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

dependencies Pull requests that update a dependency file wontfix This will not be worked on

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant