feat: Sequential beam search(a.k.a Low-memory beam search) #26304

…ce#28181) Co-authored-by: yudong.lin <yudong.lin@funplus.com>

* fix llava index errors * forward contrib credits from original implementation and fix * better fix * final fixes and fix all tests * fix * fix nit * fix tests * add regression tests --------- Co-authored-by: gullalc <gullalc@users.noreply.github.com>

…odules (huggingface#27950) * v1 * add docstring * add tests * add awq 0.1.8 * oops * fix test

Update modeling_utils.py

…ingface#28223) update docs around mixing hf scheduler with deepspeed optimizer

* Update trainer.py * format

Co-authored-by: liujizhong1 <liujizhong1@xiaomi.com>

…level timestamps computation (huggingface#28288) * Update modeling_whisper.py to support MPS backend Fixed some issue with MPS backend. First, the torch.std_mean is not implemented and is not scheduled for implementation, while the single torch.std and torch.mean are. Second, MPS backend does not support float64, so it can not cast from float32 to float64. Inverting the double() when the matrix is in the cpu fixes the issue while should not change the logic. * Found another instruction in modeling_whisper.py not implemented byor MPS After a load test, where I transcribed a 2 hours audio file, I got into a branch that did not fix in the previous commit. Similar fix, where the torch.std_mean is changed into torch.std and torch.mean * Update modeling_whisper.py removed trailing white spaces Removed trailing white spaces * Update modeling_whisper.py to use is_torch_mps_available() Using is_torch_mps_available() instead of capturing the NotImplemented exception * Update modeling_whisper.py sorting the import block Sorting the utils import block * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/whisper/modeling_whisper.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

…uggingface#28311) Bumps [tj-actions/changed-files](https://github.com/tj-actions/changed-files) from 22.2 to 41. - [Release notes](https://github.com/tj-actions/changed-files/releases) - [Changelog](https://github.com/tj-actions/changed-files/blob/main/HISTORY.md) - [Commits](tj-actions/changed-files@v22.2...v41) --- updated-dependencies: - dependency-name: tj-actions/changed-files dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

Reason is because encoder_outputs from whisper is None in this test

Commits on Jan 10, 2024

Merge branch 'main' into fix_issue_22639

gante committed Jan 10, 2024

Copy the full SHA

6054c33 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Sequential beam search(a.k.a Low-memory beam search) #26304

feat: Sequential beam search(a.k.a Low-memory beam search) #26304

Commits on Dec 22, 2023

Commits on Jan 3, 2024

Commits on Jan 4, 2024

Commits on Jan 10, 2024

Commits on Jan 16, 2024