Support optimizers with different parameters #96

DavdGao · 2022-05-20T04:31:54Z

This PR is to solve the issue A problem when using Adam optimizer #91
Solution
- Specific the parameters of the local optimizer by adding new parameters under the config cfg.optimizer and cfg.fedopt.optimizer. :
- The calling of get_optimizer is as follows

    optimizer = get_optimizer(model=model, **cfg.optimizer)

Example:
- Taking cfg.optimizer as an example, the original config file is as follows

    # ------------------------------------------------------------------------ #
    # Optimizer related options
    # ------------------------------------------------------------------------ #
    cfg.optimizer = CN(new_allowed=True)

    cfg.optimizer.type = 'SGD'
    cfg.optimizer.lr = 0.1

By setting new_allowed=True in cfg.optimizer, we allow the users to add new parameters according to the type of their optimizers. For example, if I want to use the optimizer registered as myoptimizer, as well as its new parameters mylr and mynorm. I just need to write the yaml file as follows, and the new parameters will be added automatically.

optimizer:
    type: myoptimizer
    mylr: 0.1
    mynorm: 1

joneswong · 2022-05-20T06:40:13Z

I am wondering whether it is necessary to change the exposed parameters. To resolve this issue, why not just change the argument of get_optimizer? For example, let the caller pass cfg.optimizer instead of each of the optional ones (e.g., weight_decay and momentum), and, in get_optimizer(), pack them into the appropriate kwargs regarding type of optimizer.

joneswong · 2022-05-23T13:46:54Z

I am wondering whether it is necessary to change the exposed parameters. To resolve this issue, why not just change the argument of get_optimizer? For example, let the caller pass cfg.optimizer instead of each of the optional ones (e.g., weight_decay and momentum), and, in get_optimizer(), pack them into the appropriate kwargs regarding type of optimizer.

what is your opinion? @DavdGao @rayrayraykk @yxdyc @xieyxclack

rayrayraykk · 2022-05-24T09:13:52Z

I am wondering whether it is necessary to change the exposed parameters. To resolve this issue, why not just change the argument of get_optimizer? For example, let the caller pass cfg.optimizer instead of each of the optional ones (e.g., weight_decay and momentum), and, in get_optimizer(), pack them into the appropriate kwargs regarding type of optimizer.

what is your opinion? @DavdGao @rayrayraykk @yxdyc @xieyxclack

Agreed, we could use a filter to pass the args:

FederatedScope/federatedscope/core/auxiliaries/data_builder.py

Line 155 in e625a0d

def filter_dict(func, kwarg):

This function can be applied to optimizers too.

…izer`

DavdGao · 2022-05-25T05:23:04Z

I am wondering whether it is necessary to change the exposed parameters. To resolve this issue, why not just change the argument of get_optimizer? For example, let the caller pass cfg.optimizer instead of each of the optional ones (e.g., weight_decay and momentum), and, in get_optimizer(), pack them into the appropriate kwargs regarding type of optimizer.

what is your opinion? @DavdGao @rayrayraykk @yxdyc @xieyxclack

The solution is updated accordingly.

DavdGao · 2022-05-25T05:23:20Z

I am wondering whether it is necessary to change the exposed parameters. To resolve this issue, why not just change the argument of get_optimizer? For example, let the caller pass cfg.optimizer instead of each of the optional ones (e.g., weight_decay and momentum), and, in get_optimizer(), pack them into the appropriate kwargs regarding type of optimizer.

what is your opinion? @DavdGao @rayrayraykk @yxdyc @xieyxclack

Agreed, we could use a filter to pass the args:

FederatedScope/federatedscope/core/auxiliaries/data_builder.py

Line 155 in e625a0d

def filter_dict(func, kwarg):

This function can be applied to optimizers too.

The solution is updated accordingly.

joneswong · 2022-05-30T06:35:26Z

This configuring mechanism looks cool to me! Could you provide a test case for it?

DavdGao · 2022-05-31T02:28:41Z

This configuring mechanism looks cool to me! Could you provide a test case for it?

Thanks, the unittest is added.

yxdyc

LGTM, plz see the inline comments

yxdyc · 2022-05-31T02:36:29Z

federatedscope/attack/worker_as_attacker/server_attacker.py

            dataset_name=self._cfg.data.type,
            fl_local_update_num=self._cfg.federate.local_update_steps,
-            fl_type_optimizer=self._cfg.fedopt.type_optimizer,
+            fl_type_optimizer=self._cfg.fedopt.optimizer.type,


Should we ensure forward compatibility? That is, both optimizer.grad_clip or grad.grad_clip are ok. Otherwise, we may have to exhaustively modify the historical codes to ensure that the previous experiments still work correctly

@yxdyc Thanks, I have checked the historical codes and make sure that all unittest works well.

As for gradient clipping, since it is not a common parameter in general optimizers (like the learning rate), maybe we should consider it as an independent operation and separate it from the optimizer.

Since we have set cfg.optimizer = CN(new_allowed=True), our modification also supports optimizer.grad_clip as a parameter for customized optimziers.

yxdyc · 2022-05-31T02:37:06Z

federatedscope/attack/worker_as_attacker/server_attacker.py

            fl_lr=self._cfg.optimizer.lr,
            batch_size=100)

-        # self.optimizer = get_optimizer(type=self._cfg.fedopt.type_optimizer, model=self.model,lr=self._cfg.fedopt.lr_server)
+        # self.optimizer = get_optimizer(type=self._cfg.fedopt.type_optimizer, model=self.model,lr=self._cfg.fedopt.optimizer.lr)


The same problem as "optimizer.grad_clip vs. grad.grad_clip" above.

joneswong

LGTM

DavdGao requested review from joneswong, xieyxclack, rayrayraykk and yxdyc May 20, 2022 04:31

DavdGao added 4 commits May 25, 2022 10:11

minor change

8608c45

permit to add new parameters in cfg.optimizer and `cfg.fedopt.optim…

cc31622

…izer`

bug fix

df9ab25

run format.sh

7631c25

DavdGao force-pushed the feature/hotfix_opt branch from d7df661 to 7631c25 Compare May 25, 2022 03:44

DavdGao requested a review from Osier-Yi May 25, 2022 03:47

bug fix

6de2cc3

DavdGao mentioned this pull request May 25, 2022

A problem when using Adam optimizer #91

Closed

add a unit test for the config of optimizer

c6bbaa0

yxdyc reviewed May 31, 2022

View reviewed changes

joneswong approved these changes May 31, 2022

View reviewed changes

joneswong assigned yxdyc May 31, 2022

joneswong added the bug Something isn't working label May 31, 2022

yxdyc merged commit cf97ccb into alibaba:master May 31, 2022

xieyxclack linked an issue Jun 6, 2022 that may be closed by this pull request

A problem when using Adam optimizer #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support optimizers with different parameters #96

Support optimizers with different parameters #96

DavdGao commented May 20, 2022 •

edited

joneswong commented May 20, 2022 •

edited

joneswong commented May 23, 2022

rayrayraykk commented May 24, 2022

DavdGao commented May 25, 2022

DavdGao commented May 25, 2022

joneswong commented May 30, 2022

DavdGao commented May 31, 2022

yxdyc left a comment

yxdyc May 31, 2022

DavdGao May 31, 2022

yxdyc May 31, 2022

joneswong left a comment

Support optimizers with different parameters #96

Support optimizers with different parameters #96

Conversation

DavdGao commented May 20, 2022 • edited

joneswong commented May 20, 2022 • edited

joneswong commented May 23, 2022

rayrayraykk commented May 24, 2022

DavdGao commented May 25, 2022

DavdGao commented May 25, 2022

joneswong commented May 30, 2022

DavdGao commented May 31, 2022

yxdyc left a comment

Choose a reason for hiding this comment

yxdyc May 31, 2022

Choose a reason for hiding this comment

DavdGao May 31, 2022

Choose a reason for hiding this comment

yxdyc May 31, 2022

Choose a reason for hiding this comment

joneswong left a comment

Choose a reason for hiding this comment

DavdGao commented May 20, 2022 •

edited

joneswong commented May 20, 2022 •

edited