Support Vector Machine
Short Definition
Full Definition
Support Vector Machines are one of the most elegant and theoretically well-founded machine learning algorithms, developed by Vladimir Vapnik and colleagues in the 1990s. The core idea is to find the decision boundary (hyperplane) that best separates two classes while maximizing the margin — the distance between the boundary and the closest data points from each class, called support vectors. This maximum margin principle gives SVMs strong generalization properties. For linearly separable data, the algorithm finds the unique optimal hyperplane. For non-linearly separable data, SVMs use the kernel trick to implicitly map data into a higher-dimensional space where linear separation becomes possible, without actually computing the transformation. Common kernels include linear, polynomial, and radial basis function (RBF). SVMs were the dominant algorithm in machine learning before the deep learning revolution, achieving state-of-the-art results on text classification, image recognition, and bioinformatics. They remain relevant for small to medium datasets, high-dimensional problems like text classification, and applications where theoretical guarantees are valued. SVMs can also handle regression tasks (SVR) and one-class classification for anomaly detection.
Technical Explanation
The SVM optimization problem: minimize (1/2)||w||^2 subject to y_i(w·x_i + b) >= 1 for all i. The dual formulation introduces Lagrange multipliers alpha_i: maximize sum(alpha_i) – (1/2)sum(alpha_i*alpha_j*y_i*y_j*K(x_i,x_j)) subject to alpha_i >= 0 and sum(alpha_i*y_i) = 0. The kernel trick: K(x_i, x_j) = phi(x_i)·phi(x_j) computes inner products in high-dimensional space without explicit mapping. Common kernels: linear K=x·x’, polynomial K=(x·x’+c)^d, RBF K=exp(-gamma||x-x’||^2). Soft margin SVM handles non-separable data with slack variables and penalty parameter C controlling the trade-off between margin maximization and misclassification tolerance.
Use Cases
Advantages
Disadvantages
Schema Type
Featured Snippet Candidate
Difficulty Level