Struct LBFGS

Source

pub struct LBFGS<L, P, G, F> { /* private fields */ }

Expand description

§Limited-memory BFGS (L-BFGS) method

L-BFGS is an approximation to BFGS which requires a limited amount of memory. Instead of storing the inverse, only a few vectors which implicitly represent the inverse matrix are stored.

It requires a line search and the number of vectors to be stored (history size m) must be set. Additionally an initial guess for the parameter vector is required, which is to be provided via the configure method of the Executor (See IterState, in particular IterState::param). In the same way the initial gradient and cost function corresponding to the initial parameter vector can be provided. If these are not provided, they will be computed during initialization of the algorithm.

Two tolerances can be configured, which are both needed in the stopping criteria. One is a tolerance on the gradient (set with with_tolerance_grad): If the norm of the gradient is below said tolerance, the algorithm stops. It defaults to sqrt(EPSILON). The other one is a tolerance on the change of the cost function from one iteration to the other. If the change is below this tolerance (default: EPSILON), the algorithm stops. This parameter can be set via with_tolerance_cost.

§Orthant-Wise Limited-memory Quasi-Newton (OWL-QN) method

OWL-QN is a method that adapts L-BFGS to L1-regularization. The original L-BFGS requires a loss function to be differentiable and does not support L1-regularization. Therefore, this library switches to OWL-QN when L1-regularization is specified. L1-regularization can be performed via with_l1_regularization.

TODO: Implement compact representation of BFGS updating (Nocedal/Wright p.230)

§Requirements on the optimization problem

The optimization problem is required to implement CostFunction and Gradient.

§Reference

Jorge Nocedal and Stephen J. Wright (2006). Numerical Optimization. Springer. ISBN 0-387-30303-0.

Galen Andrew and Jianfeng Gao (2007). Scalable Training of L1-Regularized Log-Linear Models, International Conference on Machine Learning.

Struct LBFGSCopy item path

§Limited-memory BFGS (L-BFGS) method

§Orthant-Wise Limited-memory Quasi-Newton (OWL-QN) method

§Requirements on the optimization problem

§Reference

Implementations§

impl<L, P, G, F> LBFGS<L, P, G, F>where F: ArgminFloat,

pub fn new(linesearch: L, m: usize) -> Self

§Example

pub fn with_tolerance_grad(self, tol_grad: F) -> Result<Self, Error>

§Example

pub fn with_tolerance_cost(self, tol_cost: F) -> Result<Self, Error>

§Example

pub fn with_l1_regularization(self, l1_coeff: F) -> Result<Self, Error>

§Example

Trait Implementations§

impl<L: Clone, P: Clone, G: Clone, F: Clone> Clone for LBFGS<L, P, G, F>

fn clone(&self) -> LBFGS<L, P, G, F>

fn clone_from(&mut self, source: &Self)

impl<'de, L, P, G, F> Deserialize<'de> for LBFGS<L, P, G, F>where L: Deserialize<'de>, P: Deserialize<'de>, G: Deserialize<'de>, F: Deserialize<'de>,

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl<L, P, G, F> Serialize for LBFGS<L, P, G, F>where L: Serialize, P: Serialize, G: Serialize, F: Serialize,

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

fn name(&self) -> &str

fn init( &mut self, problem: &mut Problem<O>, state: IterState<P, G, (), (), (), F>, ) -> Result<(IterState<P, G, (), (), (), F>, Option<KV>), Error>

fn next_iter( &mut self, problem: &mut Problem<O>, state: IterState<P, G, (), (), (), F>, ) -> Result<(IterState<P, G, (), (), (), F>, Option<KV>), Error>

fn terminate( &mut self, state: &IterState<P, G, (), (), (), F>, ) -> TerminationStatus

fn terminate_internal(&mut self, state: &I) -> TerminationStatus

Auto Trait Implementations§

impl<L, P, G, F> Freeze for LBFGS<L, P, G, F>where L: Freeze, F: Freeze, G: Freeze,

impl<L, P, G, F> RefUnwindSafe for LBFGS<L, P, G, F>where L: RefUnwindSafe, F: RefUnwindSafe, G: RefUnwindSafe, P: RefUnwindSafe,

impl<L, P, G, F> Send for LBFGS<L, P, G, F>where L: Send, F: Send, G: Send, P: Send,

impl<L, P, G, F> Sync for LBFGS<L, P, G, F>where L: Sync, F: Sync, G: Sync, P: Sync,

impl<L, P, G, F> Unpin for LBFGS<L, P, G, F>where L: Unpin, F: Unpin, G: Unpin, P: Unpin,

impl<L, P, G, F> UnwindSafe for LBFGS<L, P, G, F>where L: UnwindSafe, F: UnwindSafe, G: UnwindSafe, P: UnwindSafe,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dst: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> Same for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

fn to_subset(&self) -> Option<SS>

fn is_in_subset(&self) -> bool

fn to_subset_unchecked(&self) -> SS

fn from_subset(element: &SS) -> SP

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

impl<T> SendAlias for T

impl<T> SyncAlias for T

Struct LBFGS

impl<L, P, G, F> LBFGS<L, P, G, F>
where F: ArgminFloat,

impl<'de, L, P, G, F> Deserialize<'de> for LBFGS<L, P, G, F>
where L: Deserialize<'de>, P: Deserialize<'de>, G: Deserialize<'de>, F: Deserialize<'de>,

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

impl<L, P, G, F> Serialize for LBFGS<L, P, G, F>
where L: Serialize, P: Serialize, G: Serialize, F: Serialize,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<L, P, G, F> Freeze for LBFGS<L, P, G, F>
where L: Freeze, F: Freeze, G: Freeze,

impl<L, P, G, F> RefUnwindSafe for LBFGS<L, P, G, F>
where L: RefUnwindSafe, F: RefUnwindSafe, G: RefUnwindSafe, P: RefUnwindSafe,

impl<L, P, G, F> Send for LBFGS<L, P, G, F>
where L: Send, F: Send, G: Send, P: Send,

impl<L, P, G, F> Sync for LBFGS<L, P, G, F>
where L: Sync, F: Sync, G: Sync, P: Sync,

impl<L, P, G, F> Unpin for LBFGS<L, P, G, F>
where L: Unpin, F: Unpin, G: Unpin, P: Unpin,

impl<L, P, G, F> UnwindSafe for LBFGS<L, P, G, F>
where L: UnwindSafe, F: UnwindSafe, G: UnwindSafe, P: UnwindSafe,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,