Skip to content

data.helper 无法加载? #11

@liudonglei

Description

@liudonglei

data.helper 无法加载?

[Global Rank 0] open file /DDD/CodeExercise-Python-27k/CodeExercise-Python-27k.jsonl
[Global Rank 0]shape of cur train dataset: (4116, 4097)
[Global Rank 0]shape of cur valid dataset: (216, 4097)
[Global Rank 0]num tokens: [14803925]
[Global Rank 0]effective token rate: [0.8343119810602782]

train loss weights in rank 0: [1.0]
valid loss weights in rank 0: [1.0]
common denomination factor for CE loss in rank 0: 1
train sample weights in rank 0: [1.0]
valid sample weights in rank 0: [1.0]
global train num in rank 0: 4116
global valid num in rank 0: 216
loss weight of train dataset 0 before update in rank 0: 95.0
loss weight of train dataset 0 after update in rank 0: 1.0
Traceback (most recent call last):
File "/DDD/MFTCoder/mft_peft_hf/src/pefts/mft_accelerate.py", line 382, in
main()
File "/DDD/MFTCoder/mft_peft_hf/src/pefts/mft_accelerate.py", line 246, in main
train_dataset, valid_dataset = load_dataset_from_jsonl(args, shard_data=True, world_size=args.world_size,
File "/DDD/MFTCoder/mft_peft_hf/src/pefts/../data/gpt2_multi_task_dataset.py", line 343, in load_dataset_from_jsonl
blending_train_dataset = GPT2BlendableDataset(all_train_datasets, train_sample_weights, global_train_num, local_train_num)
File "/DDD/MFTCoder/mft_peft_hf/src/pefts/../data/gpt2_multi_task_dataset.py", line 105, in init
from data import helpers
ImportError: cannot import name 'helpers' from 'data' (/DDD/MFTCoder/mft_peft_hf/src/pefts/../data/init.py)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions