optim
29 memberslucid.optimClasses
SGD2 methodsStochastic Gradient Descent optimizer with optional momentum and weight decay.
Adam2 methodsAdaptive Moment Estimation optimizer (Kingma & Ba, 2015).
AdamW2 methodsAdam optimizer with decoupled weight decay regularisation.
LBFGS3 methodsLimited-memory Broyden–Fletcher–Goldfarb–Shanno (L-BFGS) optimizer.
RMSprop2 methodsRoot Mean Square Propagation optimizer.
Adagrad2 methodsAdaptive Gradient optimizer.
Adadelta2 methodsAdadelta optimizer — an adaptive learning rate method with no global LR.
Adamax2 methodsAdamax optimizer — a variant of Adam based on the infinity norm.
RAdam2 methodsRectified Adam optimizer with variance-adapted step size.
NAdam2 methodsNesterov-accelerated Adaptive Moment Estimation optimizer.
ASGD2 methodsAveraged Stochastic Gradient Descent optimizer.
Rprop2 methodsResilient Backpropagation optimizer.
SparseAdam2 methodsAdam optimizer designed for sparse gradient workloads.
StepLR2 methodsDecay the learning rate by a fixed multiplicative factor every fixed number of epochs.
ExponentialLR2 methodsDecay the learning rate by a constant factor every epoch.
MultiStepLR2 methodsDecay the learning rate by a fixed factor at a list of epoch milestones.
CosineAnnealingLR2 methodsAnneal the learning rate following a cosine curve over `T_max` epochs.
LambdaLR2 methodsSet the learning rate using a user-supplied multiplicative factor function.
CyclicLR2 methodsCycle the learning rate between `base_lr` and `max_lr`.
ReduceLROnPlateau2 methodsReduce the learning rate when a monitored metric stops improving.
NoamScheduler2 methodsNoam learning rate schedule from the original Transformer paper.
MultiplicativeLR2 methodsMultiply the learning rate by a factor returned by a function each epoch.
LinearLR2 methodsLinearly interpolate the learning rate from a start factor to an end factor.
ConstantLR2 methodsHold the learning rate at a constant scaled value, then restore the base LR.
PolynomialLR2 methodsDecay the learning rate using a polynomial function over a fixed number of steps.
CosineAnnealingWarmRestarts3 methodsCosine annealing with periodic warm restarts (SGDR).
OneCycleLR2 methods1Cycle learning rate policy.
SequentialLR3 methodsChain multiple schedulers and switch between them at epoch milestones.
ChainedScheduler3 methodsApply multiple schedulers simultaneously, compounding their LR changes.