We can visualize the data set that each data point has a vector, by projecting each data point on a one-, two- or three-dimensional space.
Each “view” corresponds to a particular projection on a direction
In the 1D case, the projection on a given line passing through x0 and direction a is related to a scoring function, of the form f:x→aTx+b where x∈Rn is a data point, and a∈Rn,b∈R are given.
WLOG we can set ∣∣a∣∣2=1
Centering:
We can center the scores so that the average score across data points is zero, that is:
0=m1i=1∑mf(xi)=m1i=1∑m(aTxi+b)=aTx^+b
This implies a condition on b:b=−aTxˉ, where x^:=n1i=1∑nxi∈Rn is the center of the points for that a projection
Then we can adjust the scoring function to be f(x)=aT(x−x^)
Equivalently we project the centered data points on a line passing through 0