Principle Component Analysis is a technique used to reduce the number of dimensions of your data. In the process you lose some accuracy but increase the simplicity of your data which can make it easier to use.

Basic Logic

Vocabulary

Eigenvectors: The vector that determines the direction and orientation of a Principle Component
Loading Values: The proportion each variable contributes to the direction of an Eigenvalue is the Loading Score
Eigenvalues: The sum of squares for an Eigenvector

Steps

Scale your numerical variables
Center all of your points around the origin
Find the line of best fit that goes through the origin but also maximizes the distance from your points projected points on the line, and the origin.
This is your first Principle Component and the slope of this line will tell you how important each variable is
- Create a vector along this line of length 1, this is the Eigenvector
- The sum of squares of the Eigenvector is the Eigenvalue
- The amounts each variable contribute to the slope of the Eigenvalue are the Loading Scores
Repeat this process creating as many Principle Components as there are variables which each Principle Component being perpendicular to the existing ones

Implementation

Implementation in Python
Implementation in R
Background Math