emmi.optim.lion¶

Classes¶

Implements Lion algorithm.

lion(params, grads, exp_avgs[, maximize, foreach])

Functional API that performs Lion algorithm computation.

class emmi.optim.lion.Lion(params, lr, betas=(0.9, 0.99), weight_decay=0.0, caution=False, maximize=False, foreach=None)¶

Bases: torch.optim.optimizer.Optimizer

Implements Lion algorithm.

Initialize the hyperparameters.

Parameters:

params (torch.optim.optimizer.ParamsT) – iterable of parameters to optimize or dicts defining parameter groups
lr (float) – learning rate
betas (tuple[float, float]) – coefficients used for computing running averages of gradient and its square
weight_decay (float) – weight decay coefficient
caution (bool) – apply caution
maximize (bool)
foreach (bool | None)

step(closure=None)¶

Performs a single optimization step.

Parameters:: closure – A closure that reevaluates the model and returns the loss.
Returns:: the loss.

emmi.optim.lion.lion(params, grads, exp_avgs, maximize=False, foreach=None, *, beta1, beta2, lr, weight_decay, caution)¶

Functional API that performs Lion algorithm computation.

Parameters: