Structured SVM

Structured SVM

The structured support vector machine is a machine learning algorithm that generalizes the Support Vector Machine (SVM) classifier. Whereas the SVM classifier supports binary classification, multiclass classification and regression, the structured SVM allows training of a classifier for general structured output labels.

As an example, a sample instance might be a natural language sentence, and the output label is an annotated parse tree. Training a classifier consists of showing pairs of correct sample and output label pairs. After training, the structured SVM model allows one to predict for new sample instances the corresponding output label; that is, given a natural language sentence, the classifier can produce the most likely parse tree.

For a set of training instances, from a sample space and label space, the structured SVM minimizes the following regularized risk function.

$\underset{\boldsymbol{w}}{\min} \quad \|\boldsymbol{w}\|^2 + C \sum_{n=1}^{\ell} \underset{y\in\mathcal{Y}}{\max} \left(\Delta(y_n,y) + \boldsymbol{w}'\Psi(\boldsymbol{x}_n,y) - \boldsymbol{w}'\Psi(\boldsymbol{x}_n,y_n)\right)$

The function is convex in because the maximum of a set of affine functions is convex. The function measures a distance in label space and is an arbitrary function (not necessarily a metric) satisfying and . The function is a feature function, extracting some feature vector from a given sample and label. The design of this function depends very much on the application.

Because the regularized risk function above is non-differentiable, it is often reformulated in terms of a quadratic program by introducing one slack variables for each sample, each representing the value of the maximum. The standard structured SVM primal formulation is given as follows.

$\begin{array}{cl} \underset{\boldsymbol{w},\boldsymbol{\xi}}{\min} & \|\boldsymbol{w}\|^2 + C \sum_{n=1}^{\ell} \xi_n\\ \textrm{s.t.} & \boldsymbol{w}' \Psi(\boldsymbol{x}_n,y_n) - \boldsymbol{w}' \Psi(\boldsymbol{x}_n,y) + \xi_n \geq \Delta(y_n,y),\qquad n=1,\dots,\ell,\quad \forall y \in \mathcal{Y} \end{array}$

Read more about Structured SVM: Inference Problem, Separation

Famous quotes containing the word structured:

“Just as the constant increase of entropy is the basic law of the universe, so it is the basic law of life to be ever more highly structured and to struggle against entropy.”
—Václav Havel (b. 1936)

Related Phrases

Support Vector Machine

Related Words

Structured