Description
Problems
Theory
1. Suppsoe Ξ : π³ Γπ³ β R is a PDS kernel. Prove that βπ₯, π₯β² β π³ it holds that |Ξ(π₯, π₯β²
)|
2 β€ Ξ(π₯, π₯)Ξ(π₯
β²
, π₯β²
).
2. Given a kernel πΎ on π³ define its normalized version as
πΎΒ― (π₯, π₯β²
) =
β§
βͺβ¨
βͺβ©
0 if πΎ(π₯, π₯) = 0 or πΎ(π₯
β²
, π₯β²
) = 0
πΎ(π₯, π₯β²
)
βοΈ
πΎ(π₯, π₯)
βοΈ
πΎ(π₯
β²
, π₯β²)
Otherwise.
Show that if πΎ is PDS then so is πΎΒ― .
3. Show that the following kernels on R
π are PDS:
β’ Polynomial kernel: πΎ(π₯, π₯β²
) = (οΈ
π₯
π π₯
β² + π
)οΈπΌ
for π > 0 and πΌ β N.
β’ Exponential kernel: πΎ(π₯, π₯β²
) = exp(π₯
π π₯
β²
).
β’ RBF kernel: πΎ(π₯, π₯β²
) = exp(βπΎ
2βπ₯ β π₯
β²β
2
2
).
4. Let Ξ© β R
π and let {ππ}
π
π=1 be a sequence of continuous functions on Ξ© and {ππ}
π
π=1 a sequence of
non-negative numbers. Show that πΎ(π₯, π₯β²
) = βοΈπ
π=1 ππππ (π₯)ππ (π₯
β²
) is a PDS kernel on Ξ©.
5. Show that: (i) if πΎ and πΎβ² are two reproducing kernels for an RKHS β, then they have to be the
same. (ii) the RKHS of a PDS kernel πΎ is unique.
Computation
Download the MNIST training and test .csv files from Canvas and load them on your computer. I suggest
you use Python or MATLAB for this excercise.
β’ Use Principle Component Analysis (PCA) on the training set to reduce the dimension of your input.
How many modes do you need to preserve 95% of the variance in the training set?
β’ Extract the digits 1 and 9 from the training set. Use kernel regression to design and train a classifier
to distinguish these digits using three different kernels of your choosing (I suggest RBF, Polynomila,
and linear). It is a good idea to use PCA to reduce your input dimensions here.
Also, you may use
cross validation to tune your kernel/regularization/nugget parameters if you need them. Present the
training and test error of your method.
β’ Repeat the above experiment for the digits (3, 8), (1, 7), and (5, 2).
β’ Write a report of a maximum of four pages, outlining your results and findings.