module

optim

29 members
lucid.optim

Classes

classSGD2 methods

Stochastic Gradient Descent optimizer with optional momentum and weight decay.

classAdam2 methods

Adaptive Moment Estimation optimizer (Kingma & Ba, 2015).

classAdamW2 methods

Adam optimizer with decoupled weight decay regularisation.

classLBFGS3 methods

Limited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) optimizer.

classRMSprop2 methods

Root Mean Square Propagation optimizer.

classAdagrad2 methods

Adaptive Gradient optimizer.

classAdadelta2 methods

Adadelta optimizer — an adaptive learning rate method with no global LR.

classAdamax2 methods

Adamax optimizer — a variant of Adam based on the infinity norm.

classRAdam2 methods

Rectified Adam optimizer with variance-adapted step size.

classNAdam2 methods

Nesterov-accelerated Adaptive Moment Estimation optimizer.

classASGD2 methods

Averaged Stochastic Gradient Descent optimizer.

classRprop2 methods

Resilient Backpropagation optimizer.

classSparseAdam2 methods

Adam optimizer designed for sparse gradient workloads.

classStepLR2 methods

Decay the learning rate by a fixed multiplicative factor every fixed number of epochs.

classExponentialLR2 methods

Decay the learning rate by a constant factor every epoch.

classMultiStepLR2 methods

Decay the learning rate by a fixed factor at a list of epoch milestones.

classCosineAnnealingLR2 methods

Anneal the learning rate following a cosine curve over `T_max` epochs.

classLambdaLR2 methods

Set the learning rate using a user-supplied multiplicative factor function.

classCyclicLR2 methods

Cycle the learning rate between `base_lr` and `max_lr`.

classReduceLROnPlateau2 methods

Reduce the learning rate when a monitored metric stops improving.

classNoamScheduler2 methods

Noam learning rate schedule from the original Transformer paper.

classMultiplicativeLR2 methods

Multiply the learning rate by a factor returned by a function each epoch.

classLinearLR2 methods

Linearly interpolate the learning rate from a start factor to an end factor.

classConstantLR2 methods

Hold the learning rate at a constant scaled value, then restore the base LR.

classPolynomialLR2 methods

Decay the learning rate using a polynomial function over a fixed number of steps.

classCosineAnnealingWarmRestarts3 methods

Cosine annealing with periodic warm restarts (SGDR).

classOneCycleLR2 methods

1Cycle learning rate policy.

classSequentialLR3 methods

Chain multiple schedulers and switch between them at epoch milestones.

classChainedScheduler3 methods

Apply multiple schedulers simultaneously, compounding their LR changes.