New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Sequential beam search(a.k.a Low-memory beam search) #26304
Commits on Dec 22, 2023
Commits on Jan 3, 2024
-
update the logger message with accordant weights_file_name (huggingfa…
…ce#28181) Co-authored-by: yudong.lin <yudong.lin@funplus.com>
Commits on Jan 4, 2024
-
[
Llava
] Fix llava index errors (huggingface#28032)* fix llava index errors * forward contrib credits from original implementation and fix * better fix * final fixes and fix all tests * fix * fix nit * fix tests * add regression tests --------- Co-authored-by: gullalc <gullalc@users.noreply.github.com>
-
[
Awq
] Enable the possibility to skip quantization for some target m……odules (huggingface#27950) * v1 * add docstring * add tests * add awq 0.1.8 * oops * fix test
-
-
small typo (huggingface#28229)
Update modeling_utils.py
-
Update docs around mixing hf scheduler with deepspeed optimizer (hugg…
…ingface#28223) update docs around mixing hf scheduler with deepspeed optimizer
-
Fix trainer saving safetensors: metadata is None (huggingface#28219)
* Update trainer.py * format
-
fix bug:divide by zero in _maybe_log_save_evaluate() (huggingface#28251)
Co-authored-by: liujizhong1 <liujizhong1@xiaomi.com>
-
[Whisper] Fix errors with MPS backend introduced by new code on word-…
…level timestamps computation (huggingface#28288) * Update modeling_whisper.py to support MPS backend Fixed some issue with MPS backend. First, the torch.std_mean is not implemented and is not scheduled for implementation, while the single torch.std and torch.mean are. Second, MPS backend does not support float64, so it can not cast from float32 to float64. Inverting the double() when the matrix is in the cpu fixes the issue while should not change the logic. * Found another instruction in modeling_whisper.py not implemented byor MPS After a load test, where I transcribed a 2 hours audio file, I got into a branch that did not fix in the previous commit. Similar fix, where the torch.std_mean is changed into torch.std and torch.mean * Update modeling_whisper.py removed trailing white spaces Removed trailing white spaces * Update modeling_whisper.py to use is_torch_mps_available() Using is_torch_mps_available() instead of capturing the NotImplemented exception * Update modeling_whisper.py sorting the import block Sorting the utils import block * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Bump tj-actions/changed-files from 22.2 to 41 in /.github/workflows (h…
…uggingface#28311) Bumps [tj-actions/changed-files](https://github.com/tj-actions/changed-files) from 22.2 to 41. - [Release notes](https://github.com/tj-actions/changed-files/releases) - [Changelog](https://github.com/tj-actions/changed-files/blob/main/HISTORY.md) - [Commits](tj-actions/changed-files@v22.2...v41) --- updated-dependencies: - dependency-name: tj-actions/changed-files dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Commits on Jan 10, 2024
Commits on Jan 16, 2024
-
fix error with test_modeling_whisper.py
Reason is because encoder_outputs from whisper is None in this test
-
-