Make tutorial for enabling different learning rates #1183

dfdazac · 2022-12-07T13:20:18Z

Problem Statement

First of all, thanks for the great work with the library!
It would be very useful to be able to specify different learning rates. Right now, when running a pipeline, an instance of the optimizer is created by passing all parameters in the model:

pykeen/src/pykeen/pipeline/api.py

Lines 1035 to 1039 in 313055e

    
           optimizer_instance = optimizer_resolver.make( 
        
               optimizer, 
        
               optimizer_kwargs, 
        
               params=model_instance.get_grad_params(), 
        
           )

However, in some cases we might also want to apply per-parameter options, for example

optim.SGD([
    {'params': model.base.parameters()},
    {'params': model.classifier.parameters(), 'lr': 1e-3}
], lr=1e-2, momentum=0.9)

Describe the solution you'd like

A possible solution could be an optional dictionary passed when creating the pipeline, e.g. optimizer_params. If it's not provided, then the pipeline would default to the above, otherwise the user could choose different learning rates for modules in a custom model:

optimizer_instance = optimizer_resolver.make(
        optimizer,
        optimizer_kwargs,
        params=optimizer_params if optimizer_params else model_instance.get_grad_params(),
    )

Describe alternatives you've considered

I tried getting access to the optimizer via a TrainingCallback, and I considered modifying the learning rate for different modules in the pre_step method:

class MultiLearningRateCallback(TrainingCallback):
    ....
    pre_step(self, **kwargs):
        # Here we have access to the optimizer via self.optimizer

The problem is that at this point the optimizer has already been initialized and has been assigned Parameters, which are difficult to map to the original modules.

Additional information

No response

Issue Template Checks

This is not a bug report (use a different issue template if it is)
This is not a question (use the discussions forum instead)

The text was updated successfully, but these errors were encountered:

cthoyt · 2022-12-07T13:37:18Z

I'm hesitant about this because the built-in pipeline is only supposed to cover most simple use cases. Every addition makes it more difficult to maintain, to document, and to learn. Further, I don't see any obvious simple ways to configure this from a high level.

As an alternative, it's possible to roll your own pipeline that does exactly what you want. I'd suggest checking out https://pykeen.readthedocs.io/en/stable/tutorial/first_steps.html#beyond-the-pipeline on how to roll your own pipeline.

dfdazac · 2022-12-08T12:36:35Z

Thank you @cthoyt, I understand. I wanted to figure out if there was an alternative, because we are using so many useful parts of the built-in pipeline right now. I'll give it a try, please feel free to close this issue if you think it won't be discussed any further.

cthoyt · 2022-12-08T12:43:52Z

@dfdazac if you create a minimal working example, we would love to include it in the documentation. Do you think you could do the following:

Write 1-2 sentences about why you would want to have per-parameter options in KGEM training (like besides just more configurability, what's a concrete scenario where this would actually be helpful?)
Give an end-to-end code example, maybe based on the beyond-the-pipeline section but includes your updates

You could make your own RST document in https://github.com/pykeen/pykeen/tree/master/docs/source/tutorial in a PR that includes this.

mberr · 2023-01-09T22:50:23Z

I might be late to the party, but another option (still quite hacky) would be to create a custom subclass of Optimizer, and register it with the resolver

from pykeen.optimizers import optimizer_resolver
from torch import optim, nn

class ModifiedSGD(optim.SGD):
  def __init__(self, params: Iterable[nn.Parameter], custom_lrs: list[tuple[list[nn.Parameter], float]], **kwargs):
    custom_param_ids = set(id(p) for p in custom_params for custom_params, _ in custom_lrs)
    default_params = (p for p in params if id(p) not in custom_param_ids)
    super().__init__(params=[
      {"params": default_params},
      *({"params": custom_params, "lr": custom_lr} for custom_params, custom_lr in custom_lrs),
    ], **kwargs)


optimizer_resolver.register(ModifiedSGD)

You can now use this optimizer with the pipeline

from pykeen.pipeline import pipeline

pipeline(
  optimizer=ModifiedSGD,
  optimizer_kwargs=dict(
    custom_params=[(model.classifier.parameters(), dict(lr=1e-3))],
    lr=1e-2,
    momentum=0.9,
  ),
  ...
)

dfdazac added the enhancement New feature or request label Dec 7, 2022

cthoyt changed the title ~~Enabling different learning rates~~ Make tutorial for enabling different learning rates Dec 8, 2022

cthoyt added documentation Improvements or additions to documentation and removed enhancement New feature or request labels Dec 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make tutorial for enabling different learning rates #1183

Make tutorial for enabling different learning rates #1183

dfdazac commented Dec 7, 2022 •

edited

cthoyt commented Dec 7, 2022

dfdazac commented Dec 8, 2022

cthoyt commented Dec 8, 2022

mberr commented Jan 9, 2023

Make tutorial for enabling different learning rates #1183

Make tutorial for enabling different learning rates #1183

Comments

dfdazac commented Dec 7, 2022 • edited

Problem Statement

Describe the solution you'd like

Describe alternatives you've considered

Additional information

Issue Template Checks

cthoyt commented Dec 7, 2022

dfdazac commented Dec 8, 2022

cthoyt commented Dec 8, 2022

mberr commented Jan 9, 2023

dfdazac commented Dec 7, 2022 •

edited