202412121103
Status: #idea
Tags: Unsupervised Learning, Linear Algebra, Singular Value Decomposition
State: #nascient

Principal Components Analysis (PCA)

Data exists in a pdimensional space.
But not all dimensions are made equal.

PCA seeks to find the dimensions across which variability is biggest.
The first component represents the highest variability (and the size of it represents how much variability), and the loading vector represents the direction of maximum variability.

The principal components are sorted in decreasing order such that that the last principal component p is the one where variation is the smallest.

The principal components are orthogonal

Pros

Cons