Skip to content

seaco_paraformer微调时报错:seaco_id = samples[0]["seaco_id"] IndexError: list index out of range #2434

@1xuanxia

Description

@1xuanxia

Notice: In order to resolve issues more efficiently, please raise issue following the template.
(注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)

❓ Questions and Help

Before asking:

  1. search the issues.
  2. search the docs.

What is your question?

使用seaco_paraformer进行微调时,发现如下问题,已经使用最新的seaco_paraformer模型和finetune.sh代码,训练数据格式正确,仍然出现以下问题,funasr=1.2.0。

Code

File "/opt/asr_finetune/FunASR/examples/industrial_data_pretraining/seaco_paraformer/../../../funasr/bin/train_ds.py", line 56, in main_hydra
main(**kwargs)
File "/opt/asr_finetune/FunASR/examples/industrial_data_pretraining/seaco_paraformer/../../../funasr/bin/train_ds.py", line 173, in main
trainer.train_epoch(
File "/root/anaconda3/envs/funasr/lib/python3.10/site-packages/funasr/train_utils/trainer_ds.py", line 578, in train_epoch
for batch_idx, batch in enumerate(dataloader_train):
File "/root/anaconda3/envs/funasr/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 701, in next
data = self._next_data()
File "/root/anaconda3/envs/funasr/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 757, in _next_data
data = self._dataset_fetcher.fetch(index) # may raise StopIteration
File "/root/anaconda3/envs/funasr/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 55, in fetch
return self.collate_fn(data)
File "/root/anaconda3/envs/funasr/lib/python3.10/site-packages/funasr/datasets/audio_datasets/datasets.py", line 215, in collator
seaco_id = samples[0]["seaco_id"]
IndexError: list index out of range
E0315 13:47:12.072000 618603 site-packages/torch/distributed/elastic/multiprocessing/api.py:869] failed (exitcode: 1) local_rank: 0 (pid: 618658) of binary: /root/anaconda3/envs/funasr/bin/python
Traceback (most recent call last):
File "/root/anaconda3/envs/funasr/bin/torchrun", line 33, in
sys.exit(load_entry_point('torch==2.5.1', 'console_scripts', 'torchrun')())

What have you tried?

What's your environment?

  • OS (e.g., Linux):
  • FunASR Version (e.g., 1.0.0):
  • ModelScope Version (e.g., 1.11.0):
  • PyTorch Version (e.g., 2.0.0):
  • How you installed funasr (pip, source):
  • Python version:
  • GPU (e.g., V100M32)
  • CUDA/cuDNN version (e.g., cuda11.7):
  • Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1)
  • Any other relevant information:

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions