A short overview of implementing principal component analysis from first principles in R.

Photo by Edward Howell on Unsplash

Principal component analysis (PCA) is the orthogonal linear transformation of the set of original variables. PCA is commonly used for dimensionality reduction since one can transform a dataset with p random variables to a dataset with k<p variables that still contains most of the information of the original dataset.

There is a package for everything even for something as simple as PCA. Despite this, it is often helpful to do things from first principles. This assists in understanding the method and also provides a deeper intuition into what is being done. Although there is a package for PCA I will…

Claudio Jardim

Data scientist

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store