Traning loss not showing with trainer #36102

mwnthainarzary · 2025-02-08T09:00:55Z

System Info

Python 3.11.11
transformers 4.48.2

Who can help?

@muellerzr
@SunMarc

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Define the TrainingArguments for fine-tuning

training_args = TrainingArguments(
output_dir='/content/drive/MyDrive/Legal_Dataset/BartlargeFineTuned/',
num_train_epochs=10,
per_device_train_batch_size=10,
gradient_accumulation_steps=8,
evaluation_strategy="epoch",
save_total_limit=1,
save_steps=1000,
learning_rate=1e-3,
do_train=True,
do_eval=True,
remove_unused_columns=False,
push_to_hub=False,
report_to='tensorboard',
load_best_model_at_end=False,
lr_scheduler_type="cosine_with_restarts",
warmup_steps=100,
weight_decay=0.01,
logging_dir='/content/drive/MyDrive/Legal_Dataset/BartlargeFineTuned/',
logging_steps=200,

)

Create a data collator for sequence-to-sequence tasks

data_collator = MyDataCollatorForSeq2Seq(
tokenizer=tokenizer,
model=model,
padding=False,
max_length=80,
label_pad_token_id=tokenizer.pad_token_id,
)

Create Trainer

trainer = Trainer(
model=model,
args=training_args,
data_collator=data_collator,
train_dataset=train_dataset,
eval_dataset=validation_dataset,
optimizers=(custom_optimizer, None),
)

trainer.train()

Expected behavior

I trained the model for 10 epochs but in every epochs I saw the validation loss only not training loss. Please help

The text was updated successfully, but these errors were encountered:

neonwatty · 2025-02-12T16:30:23Z

In your TrainingArguments perhaps try

examine output in your logging_dir to confirm
switching evaluation_strategy to eval_strategy (it looks like evaluation_strategy is depreciated)
lowering your logging_steps - at its current value of 200 may too large given the size of your dataset (you might try evaluating every X steps instead of epoch to triangulate the right value)

github-actions · 2025-03-11T08:03:21Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

mwnthainarzary added the bug label Feb 8, 2025

github-actions bot closed this as completed Mar 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Traning loss not showing with trainer #36102

Traning loss not showing with trainer #36102

mwnthainarzary commented Feb 8, 2025

neonwatty commented Feb 12, 2025 •

edited

Loading

github-actions bot commented Mar 11, 2025

Traning loss not showing with trainer #36102

Traning loss not showing with trainer #36102

Comments

mwnthainarzary commented Feb 8, 2025

System Info

Who can help?

Information

Tasks

Reproduction

Define the TrainingArguments for fine-tuning

Create a data collator for sequence-to-sequence tasks

Create Trainer

Expected behavior

neonwatty commented Feb 12, 2025 • edited Loading

github-actions bot commented Mar 11, 2025

neonwatty commented Feb 12, 2025 •

edited

Loading