This is a package for non-smooth optimization algorithms based on proximal methods.
Provides proximal operator evaluation routines and proximal optimization algorithms, such as (accelerated) proximal gradient methods and alternating direction method of multipliers (ADMM), for non-smooth/non-differentiable objective functions.
The following proximal operators are supported
-
soft_thresh(x, λ)
: proximal operator of$\ell_{1}$ -norm -
block_soft_thresh(x, λ)
: proximal operator of$\ell_{2}$ -norm -
shrinkage(x, λ)
: proximal operator of$\ell^{2}_{2}$ -norm (ridge) -
shrinkage(x, λ, A, b)
: proximal operator of quadratic function$f(x) = c + b^{\prime}x + x^{\prime}Ax$ -
smooth(x, λ, f, ∇f!, y_prev)
: proximal operator of a general smooth objective function$f(x)$
Proximal gradient methods, w/ and w/o acceleration, can be used to optimize an objective function that can be split into two components, one of which is differentiable, i.e.
where
Alternating direction method of multipliers (ADMM), also known as Douglas-Rachford splitting, can be used to optimize an objective function that can be split into two components, where both components can be non-smooth.
Linearized ADMM can be used to solve problems of the form
where
For the proximal gradient method there exist so-called accelerated versions, which implies the following update step at iteration
Two flavours of this acceleration are implemented
-
Simple extrapolation, i.e.
$\omega^{k} = \frac{k - 1}{k + 2}$ for every iteration$k$ . -
Nesterov momentum extrapolation, i.e.
$\omega^{k} = \frac{\lambda^{k}\theta^{k-1}(1 - \theta^{k-1})}{\lambda^{k-1}\theta^{k} + \lambda^{k}(\theta^{k-1})^{2}}$ where$\theta^{k}$ is the positive root of the quadratic equation$$ \frac{(\theta^{k})^{2}}{\lambda^{k}} = (1 - \theta^{k})\frac{(\theta^{k-1})^{2}}{\lambda^{k-1}} + m\theta^{k} $$
To install, in the Julia REPL:
using Pkg; Pkg.add("ProximalMethods")