These pages are not displaying properly because the Compatibility View in your Internet Explorer is enabled. We suggest that you remove 'fu-berlin.de' from your list of sites that have Compatibility View enabled.

In Internet Explorer, press the 'Alt' key to display the Menu bar, or press and hold the address bar and select 'Menu bar'.

Click 'Tools' and select 'Compatibility View settings'.

Select 'fu-berlin.de' under 'Websites you've added to Compatibility View'.

Click 'Remove'.

Support Vector Machine (Statistics)

The master bioinformatics offers the chance to get acquainted to several machine learning techniques. Machine learning algorithms are used in various courses, the theoretical ground work is taught in statistics. There we repeat how linear regression and hidden markov machines work, learning more about the mathematical concept of markov chains, and we get introduced to support vector machines (SVMs) and how to make them more efficient using the kernel trick.

SVMs are supervised learning models usually used on very large data sets which you want to divide into two classes. The division is performed by laying a plane or hyperplane between the two classes and maximizing the distance to each of them. If the classes are not linearly separable you have to transform the input space with a transformation function, called phi here, which introduces more dimensions. This function phi has to be cleverly chosen such that the two classes get cleanly separated without introducing too many new dimensions. Of course if you add enough dimensions everything can become separable, even if in reality there are no features distinguishing them. Another problem with adding too many dimensions is overfitting (learning properties of single data points instead of general class properties). Even when it does make sense to introduce this many new dimensions you will end up running into the curse of dimensionality. For with every new dimension the input points end up further apart from each other until the data is so sparse that the positioning of the dividing hyper plane becomes in a way arbitrary, which of course compromises your results when trying to classify new points.

Match the functions used for transforming the input space to the resulting decision borders. x=(x₁,x₂)

Φ(x): ℝ² → ℝ²

We didn't change the number of dimensions, the space is still two-dimensional and thus the decision boundary linear.

Φ(x): ℝ² → ℝ⁵

Enough dimensions were added to correctly divide the two classes, but not overfit.

Φ(x): ℝ² → ℝ¹⁰²

So many dimensions were added that the decision boundary was fitted to single points of one class.

You get feedback for each answer by clicking on the button.

M.Sc. Bioinformatics

Support Vector Machine (Statistics)

Match the functions used for transforming the input space to the resulting decision borders. x=(x₁,x₂)

Mark if the following statements are true or not.

Support Vector Machine (Statistics)

Match the functions used for transforming the input space to the resulting decision borders. x=(x1,x2)

Mark if the following statements are true or not.

Match the functions used for transforming the input space to the resulting decision borders. x=(x₁,x₂)