What is Kernel SVM?
It is a concept that was developed to compensate for the disadvantage that SVM cannot handle non-linear datasets like image below.
data:image/s3,"s3://crabby-images/2e941/2e94173f2c50b429f8eb5afb0fa7b803ba1b7820" alt=""
So, how does Kernel SVM classify non-linear datasets? The answer is by mapping them to a higher dimension like image below.
data:image/s3,"s3://crabby-images/d7e9a/d7e9ae71f28744f3cb7a88cf8e472b59243c01da" alt=""
By mapping to a higher dimension, we can obtain a linearly separable dataset divided by a hyperplane in that space. After this process, by projecting back to the lower dimension, you can get a result with a round hyperplane, as shown in the image below.
data:image/s3,"s3://crabby-images/abca3/abca3262a5f28bb78ef9ee64cd3a7d927e5468dc" alt=""
Can Kernel SVM then classify a more complex dataset, as shown in the image below?”
data:image/s3,"s3://crabby-images/af346/af346680163eea23b1caf4b93d0e55a4cb73a25d" alt=""
The answer is YES. By adding more than two kernel formulas such as the Gaussian RBF kernel, it can classify the dataset.
Here is a simple Gaussian RBF kernel formula:
So, the result of the kernel SVM will appear like this:
data:image/s3,"s3://crabby-images/8f01c/8f01ca87997108943ab03259c113bdc81e99b0e7" alt=""
- Green when:
$K($ $\overrightarrow{x}$ $, \overrightarrow{l}^{1} )$ $+ K($ $\overrightarrow{x}$ $, \overrightarrow{l}^{2} ) > 0$ - Red when:
$K($ $\overrightarrow{x}$ $, \overrightarrow{l}^{1} )$ $+ K($ $\overrightarrow{x}$ $, \overrightarrow{l}^{2} ) = 0$
Types of Kernel Functions
- Gaussian RBF Kernel
data:image/s3,"s3://crabby-images/5eea0/5eea0eee88dcd2d0e4f7d14ae7b6b1cc0b1d840f" alt=""
- Sigmoid Kernel
data:image/s3,"s3://crabby-images/cc6a3/cc6a3126cee785e4e2c3485dd7eeb40746e78778" alt=""
- Polynomial Kernel
data:image/s3,"s3://crabby-images/e8c42/e8c4286c941f66085a309ae2d851ba169d708ed5" alt=""
Example
Code
1
2
3
4
5
6
7
8
9
10
11
from sklearn.preprocessing import StandardScaler
from sklearn.svm import SVC
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)
classifier = SVC(kernel='rbf', random_state=0)
classifier.fit(X_train, y_train)
y_pred = classifier.predict(X_test)
Result
data:image/s3,"s3://crabby-images/18f9a/18f9a8366abbf543be40bf90e92e6fe9a7d4097f" alt=""