Multidimensional Scaling (MDS)-白红宇

Multidimensional Scaling (MDS)

阅读量：4040 次

发布时间：2019-05-24

本文共 3095 字，大约阅读时间需要 10 分钟。

MDS aims to embed data in a lower dimensional space in such a way that pair-wise distances between data points are preserved.

Say we have N points $x_i \in R^n$ for $i \in [1, N]$ , let $X = [x_1, x_2, \cdots, x_N]$ , we don’t know the postion of $x_i$ . We are only supplied with the pair-wise Euclidean distances among these points. Now the objection is to find out N points $y_i \in R^k, k < n$ , let $Y = [y_1, y_2, \cdots, y_N]$ , such that the distance in pairs of X is the same as these of Y.

Given the distance matrix $D^X$ , each element of $D^X$ can be written as:

(DXij)2=(xi−xj)T(xi−xj)=∥xi∥2−2xTixj+∥xj∥2 $(D^X_{ij})^2 = (x_i-x_j)^T(x_i-x_j)=\lVert x_i \rVert^2-2x^T_ix_j+\lVert x_j \rVert^2$

we can easily see that

DX=Z−2XTX+ZT $D^X = Z-2X^TX+Z^T$

Here, $Z = ze^T$ and $z = [\lVert x_1 \rVert^2 \lVert x_2 \rVert^2 \cdots \lVert x_N \rVert^2]^T$ . Therefore Z takes the form

Z = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ∥ x 1 ∥ 2 ∥ x 1 ∥ 2 ⋮ ∥ x 1 ∥ 2 ∥ x 1 ∥ 2 ∥ x 1 ∥ 2 ⋮ ∥ x 1 ∥ 2 \dots \dots ⋱ \dots ∥ x 1 ∥ 2 ∥ x 1 ∥ 2 ⋮ ∥ x 1 ∥ 2 ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥

$Z = \begin{bmatrix}\lVert x_1 \rVert^2 &\lVert x_1 \rVert^2&\cdots&\lVert x_1 \rVert^2\\\lVert x_1 \rVert^2 &\lVert x_1 \rVert^2&\cdots&\lVert x_1 \rVert^2\\\vdots&\vdots& \ddots &\vdots\\\lVert x_1 \rVert^2 &\lVert x_1 \rVert^2&\cdots&\lVert x_1 \rVert^2\\\end{bmatrix}$

Now, let’s translate the mean of the set of hypothetical point set $X$ to the origin. Note that this operation does not change the Euclidean distance between any pairs of points.

For better understanding, we introduce

1NAeeT

$\frac{1}{N}Aee^T$ and

1NeeTA $\frac{1}{N}ee^TA$ . Here, A is a N-by-N matrix which taks the form:

A = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ A 11 A 21 ⋮ A N 1 A 12 A 22 ⋮ A N 2 \dots \dots ⋱ \dots A 1 N A 2 N ⋮ A N N ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥

$A = \begin{bmatrix}A_{11} &A_{12} &\cdots&A_{1N}\\A_{21} &A_{22} &\cdots&A_{2N}\\\vdots&\vdots& \ddots &\vdots\\A_{N1} &A_{N2} &\cdots&A_{NN}\\\end{bmatrix}$

Hence,

1 N A e e T = 1 N ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ A 11 A 21 ⋮ A N 1 A 12 A 22 ⋮ A N 2 \dots \dots ⋱ \dots A 1 N A 2 N ⋮ A N N ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ 11 ⋮ 1 11 ⋮ 1 \dots \dots ⋱ \dots 11 ⋮ 1 ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ 1 N \sum j = 1 N A 1 j 1 N \sum j = 1 N A 2 j ⋮ 1 N \sum j = 1 N A N j 1 N \sum j = 1 N A 1 j 1 N \sum j = 1 N A 2 j ⋮ 1 N \sum j = 1 N A N j \dots \dots ⋱ \dots 1 N \sum j = 1 N A 1 j 1 N \sum j = 1 N A 2 j ⋮ 1 N \sum j = 1 N A N j ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ mean of first row of A mean of second row of A ⋮ mean of Nth row of A mean of first row of A mean of second row of A ⋮ mean of Nth row of A \dots \dots ⋱ \dots mean of first row of A mean of second row of A ⋮ mean of Nth row of A ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥

$\frac{1}{N}Aee^T=\frac{1}{N}\begin{bmatrix}A_{11} &A_{12} &\cdots&A_{1N}\\A_{21} &A_{22} &\cdots&A_{2N}\\\vdots&\vdots& \ddots &\vdots\\A_{N1} &A_{N2} &\cdots&A_{NN}\\\end{bmatrix}\begin{bmatrix}1 &1 &\cdots&1\\1 &1 &\cdots&1\\\vdots&\vdots& \ddots &\vdots\\1 &1 &\cdots&1\\\end{bmatrix}=\begin{bmatrix}\frac{1}{N}\sum_{j=1}^N A_{1j} &\frac{1}{N}\sum_{j=1}^N A_{1j} &\cdots&\frac{1}{N}\sum_{j=1}^N A_{1j}\\\frac{1}{N}\sum_{j=1}^N A_{2j} &\frac{1}{N}\sum_{j=1}^N A_{2j} &\cdots&\frac{1}{N}\sum_{j=1}^N A_{2j}\\\vdots&\vdots& \ddots &\vdots\\\frac{1}{N}\sum_{j=1}^N A_{Nj} &\frac{1}{N}\sum_{j=1}^N A_{Nj} &\cdots&\frac{1}{N}\sum_{j=1}^N A_{Nj}\\\end{bmatrix}\\= \begin{bmatrix}\text{mean of first row of A} &\text{mean of first row of A} &\cdots&\text{mean of first row of A}\\\text{mean of second row of A} &\text{mean of second row of A} &\cdots&\text{mean of second row of A}\\\vdots&\vdots& \ddots &\vdots\\\text{mean of Nth row of A} &\text{mean of Nth row of A} &\cdots&\text{mean of Nth row of A}\\\end{bmatrix}$

similiarly,

1 N e e T A = 1 N ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ 11 ⋮ 1 11 ⋮ 1 \dots \dots ⋱ \dots 11 ⋮ 1 ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ A 11 A 21 ⋮ A N 1 A 12 A 22 ⋮ A N 2 \dots \dots ⋱ \dots A 1 N A 2 N ⋮ A N N ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ 1 N \sum i = 1 N A i 1 1 N \sum i = 1 N A i 1 ⋮ 1 N \sum i = 1 N A i 1 1 N \sum i = 1 N A i 2 1 N \sum i = 1 N A i 2 ⋮ 1 N \sum i = 1 N A i 2 \dots \dots ⋱ \dots 1 N \sum i = 1 N A i N 1 N \sum i = 1 N A i N ⋮ 1 N \sum i = 1 N A i N ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ mean of first column of A mean of first column of A ⋮ mean of first column of A mean of second column of A mean of second column of A ⋮ mean of second column of A \dots \dots ⋱ \dots mean of Nth column of A mean of Nth column of A ⋮ mean of Nth column of A ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥

$\frac{1}{N}ee^TA=\frac{1}{N}\begin{bmatrix}1 &1 &\cdots&1\\1 &1 &\cdots&1\\\vdots&\vdots& \ddots &\vdots\\1 &1 &\cdots&1\\\end{bmatrix}\begin{bmatrix}A_{11} &A_{12} &\cdots&A_{1N}\\A_{21} &A_{22} &\cdots&A_{2N}\\\vdots&\vdots& \ddots &\vdots\\A_{N1} &A_{N2} &\cdots&A_{NN}\\\end{bmatrix}=\begin{bmatrix}\frac{1}{N}\sum_{i=1}^N A_{i1} &\frac{1}{N}\sum_{i=1}^N A_{i2} &\cdots&\frac{1}{N}\sum_{i=1}^N A_{iN}\\\frac{1}{N}\sum_{i=1}^N A_{i1} &\frac{1}{N}\sum_{i=1}^N A_{i2} &\cdots&\frac{1}{N}\sum_{i=1}^N A_{iN}\\\vdots&\vdots& \ddots &\vdots\\\frac{1}{N}\sum_{i=1}^N A_{i1} &\frac{1}{N}\sum_{i=1}^N A_{i2}&\cdots&\frac{1}{N}\sum_{i=1}^N A_{iN}\\\end{bmatrix}\\= \begin{bmatrix}\text{mean of first column of A} &\text{mean of second column of A} &\cdots&\text{mean of Nth column of A}\\\text{mean of first column of A} &\text{mean of second column of A} &\cdots&\text{mean of Nth column of A}\\\vdots&\vdots& \ddots &\vdots\\\text{mean of first column of A} &\text{mean of second column of A} &\cdots&\text{mean of Nth column of A}\\\end{bmatrix}$

The centering matrix is defined as:

H = I N - 1 N e e T

$H = I_N - \frac{1}{N}ee^T$

Let’s now apply double centering to

$D^X$ to get

A X = H D X H = (I N - 1 N e e T) (Z - 2 X T X + Z T) (I N - 1 N e e T) = (I N - 1 N e e T) Z (I N - 1 N e e T) - 2 (I N - 1 N e e T) X T X (I N - 1 N e e T) + (I N - 1 N e e T) Z T (I N - 1 N e e T) = - 2 (I N - 1 N e e T) X T X (I N - 1 N e e T) = - 2 (X (I N - 1 N e e T)) T X (I N - 1 N e e T) = - 2 X ~ T X ~

$A^X = HD^XH=( I_N - \frac{1}{N}ee^T)(Z-2X^TX+Z^T)( I_N - \frac{1}{N}ee^T)\\=( I_N - \frac{1}{N}ee^T)Z( I_N - \frac{1}{N}ee^T)-2( I_N - \frac{1}{N}ee^T)X^TX( I_N - \frac{1}{N}ee^T)+( I_N - \frac{1}{N}ee^T)Z^T( I_N - \frac{1}{N}ee^T)\\=-2( I_N - \frac{1}{N}ee^T)X^TX( I_N - \frac{1}{N}ee^T)=-2(X( I_N - \frac{1}{N}ee^T))^TX( I_N - \frac{1}{N}ee^T)=-2\tilde X^T\tilde X$

where

X~=X(IN−1NeeT)

$\tilde X = X( I_N - \frac{1}{N}ee^T)$

B X = - 1 2 A X = - 1 2 H D X H = X ~ T X ~

$B^X = -\frac{1}{2}A^X= -\frac{1}{2}HD^XH=\tilde X^T\tilde X$

Remember, the task was to find a concrete set of N points $Y$ in k dimensions so that the pairwise Euclidean distances betwwen all the pairs in the concrete set