Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 473 Bytes

adaptive_gradient_clipping.rst

File metadata and controls

13 lines (8 loc) · 473 Bytes

Gradient Clipping

There has also been added support for adaptive gradient clipping, based on this implementation:

model = GradientAccumulateModel(
    accum_steps=4, use_agc=True, clip_factor=0.01, eps=1e-3, inputs=model.input, outputs=model.output
)

The hyperparameters values for clip_factor and eps presented here are the default values.