Skip to content
This repository was archived by the owner on Jun 4, 2025. It is now read-only.

Conversation

@natuan
Copy link

@natuan natuan commented Oct 4, 2021

This change is to use distillation modifier in order to update the loss for distillation.
Requires https://github.com/neuralmagic/sparseml/pull/398/files.

@natuan natuan requested review from a team, bfineran, markurtz, mgoin and spacemanidol October 4, 2021 03:50
@natuan natuan changed the title Use distillation modifier from SparseML Use distillation modifier from SparseML for QA task Oct 4, 2021
@markurtz markurtz merged commit 7e13e7d into master Oct 8, 2021
KSGulin pushed a commit that referenced this pull request Mar 9, 2022
* Use distillation modifier from SparseML

* Move teacher model's logic to modifier

Co-authored-by: Mark Kurtz <mark@neuralmagic.com>
@dbogunowicz dbogunowicz deleted the distill_modifier branch December 5, 2023 10:26
bfineran pushed a commit that referenced this pull request Jun 5, 2024
* Cohere Model Release (#1)

Cohere Model Release

* Remove unnecessary files and code (#2)

Some cleanup

* Delete cohere-model directory (#3)

* Make Fix (#5)

* Pr fixes (#6)

* fixes for pr

* pr fixes for the format

* pr fixes for the format

* src/transformers/models/auto/tokenization_auto.py

* Tokenizer test (#8)

* tokenizer test

* format fix

* Adding Docs and other minor changes (#7)

* Add modeling tests (#9)

* Smol Fix (#11)

* tokenization tests are fixed

* format fixes

* fix pr doc tests

* fix pr doc tests

* fix pr doc tests

* fix pr style check

* small changes in cohere.md

* FIX: Address final comments for transformers integration (#13)

* fix modeling final nits and add proper test file

* for now leave empty tests

* add integration test

* push new test

* fix modeling cohere (#14)

* Update chat templates to use the new API (#15)

---------

Co-authored-by: ahmetustun <ahmetustun89@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants