issues Search Results · repo:huggingface/transformers language:Python
Filter by
17k results
(95 ms)17k results
inhuggingface/transformers (press backspace or delete to remove)System Info
Ascend NPU transformers =4.50.0 torch 2.1
Who can help?
No response
Information
- [ ] The official example scripts
- [ ] My own modified scripts
Tasks
- [ ] An officially supported ...
bug
jiaqiw09
- 1
- Opened 5 hours ago
- #38468
System Info
Ran into this bug https://github.com/huggingface/transformers/issues/38329. Tried installing from main to get access to
this fix https://github.com/huggingface/transformers/pull/38376, but ...
bug
mattdornfeld
- 2
- Opened 8 hours ago
- #38464
System Info
transformers==4.51.3 Ascend 910B torch==2.5.1 torch-npu==2.5.1
Who can help?
No response
Information
- [ ] The official example scripts
- [ ] My own modified scripts
Tasks
- [ ...
bug
Enlion91
- 1
- Opened 10 hours ago
- #38462
System Info
torch.get_default_device() is not an official PyTorch API (even in 2.2), but it’s being called inside modeling_utils.py
in version 4.52.1.
get_torch_context_manager_or_global_device() function ...
bug
shashank-shivam
- 2
- Opened 16 hours ago
- #38457
System Info
When trying to quantize w/ torchao on 4.52.3 and not having bits and bytes installed I am getting:
python ao/prep_model.py --model_name facebook/opt-125m --quant_type fp8 --granularity ...
bug
drisspg
- 1
- Opened 17 hours ago
- #38454
Feature request
In loss_utils.py, logits are upcasted for float32 for some losses. This can waste memory for cases where certain labels
are ignore_index. This is especially true for fine tuning cases ...
Feature request
harshit2997
- 1
- Opened 18 hours ago
- #38452
System Info
File: modeling_qwen2_5_omni.py
def _apply_rotary_pos_emb_flashatt(self, tensor: torch.Tensor, freqs: torch.Tensor) - torch.Tensor:
tensor_ = tensor.float()
cos = freqs.cos() ...
bug
HarryHsing
- 2
- Opened 19 hours ago
- #38451
Feature request
Hi 🤗 team and contributors,
I m currently exploring ways to extend the transformers library to support temporal grounding — the task of identifying
a [start, end] timestamp segment in ...
Feature request
mreraser
- 1
- Opened 21 hours ago
- #38450
def fixed_cross_entropy(source, target, num_items_in_batch: int = None, ignore_index: int = -100, **kwargs): I check the
shape of the inputs and find follows: In [1]: logits.shape Out[1]: torch.Size([4, ...
SHIFTTTTTTTT
- 2
- Opened 21 hours ago
- #38448
System Info
Package Version Editable project location
accelerate 1.7.0 aiohappyeyeballs 2.4.4 aiohttp 3.11.9 aiosignal 1.3.1 altair 5.5.0 annotated-types 0.7.0
anyio 4.6.2.post1 argon2-cffi 23.1.0 argon2-cffi-bindings ...
bug
qsuzer
- 1
- Opened 23 hours ago
- #38442

Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.
Learn how you can use GitHub Issues to plan and track your work.
Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub IssuesProTip!
Restrict your search to the title by using the in:title qualifier.