Skip to content

Conversation

@lucas-aixplain
Copy link
Contributor

Task: M-5542693469

warmup_ratio: float = 0.0
warmup_steps: int = 0
early_stopping_patience: int = 1
lr_scheduler_type: SchedulerType = SchedulerType.LINEAR
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does @dataclass_json converts the enumerator to string automatically?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, by running the following line:

finetune = FinetuneFactory.create(str(uuid.uuid4()), dataset_list, model, hyperparameters=Hyperparameters(lr_scheduler_type=SchedulerType.CONSTANT))

We get the following request body:

{
  "datasets": [
    {
      "datasetId": "65293795abed4d551b86baac",
      "trainPercentage": 100,
      "devPercentage": 0
    }
  ],
  "sourceModelId": "64e615671567f848804985e1",
  "parameters": {
    "hyperparameters": {
      "epochs": 4,
      "train_batch_size": 4,
      "eval_batch_size": 4,
      "learning_rate": 2e-05,
      "generation_max_length": 225,
      "tokenizer_batch_size": 256,
      "gradient_checkpointing": false,
      "gradient_accumulation_steps": 1,
      "max_seq_length": 4096,
      "warmup_ratio": 0.0,
      "warmup_steps": 0,
      "early_stopping_patience": 1,
      "lr_scheduler_type": "constant"
    }
  }
}

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should you include these parameters in the functional test?

@thiago-aixplain thiago-aixplain merged commit ab51903 into main Nov 29, 2023
@thiago-aixplain thiago-aixplain deleted the M-5542693469-addLRSchedulerEarlyStopping branch November 29, 2023 14:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants