Adadelta¶
- class Adadelta(params, lr=1.0, rho=0.9, eps=1e-06, weight_decay=0.0)[source]¶
Implements Adadelta algorithm proposed in “ADADELTA: An Adaptive Learning Rate Method”.
- Parameters
params (Union[Iterable[Parameter], dict]) – iterable of parameters to optimize or dicts defining parameter groups.
lr (float) – coefficient that scales delta before it is applied to the parameters. Default: 1.0.
rho (float) – coefficient used for computing a running average of squared gradients. Default: 0.9.
eps (float) – term added to the denominator to improve numerical stability. Default: 1e-6.
weight_decay (float) – weight decay (L2 penalty). Default: 0.
- Returns
An instance of the Adadelta optimizer.