202405181414
Status: #idea
Tags: Regression Analysis
Coefficient of Determination
Often denoted
It is computed as follows:
But be careful, Correlation and Coefficient of Determination are not the same in general. In fact, as soon as we get to the multivariate case they are no longer equal since correlation measures the linear relation between two variables, and the coefficient of determination measures the explanatory power of an entire model.
Similarly to the Correlation it is bounded between 0 and 1, where 1 indicates a perfect explanatory power and 0 explains basically no power (no better than the mean line itself.)
There is a concept called Adjusted Coefficient of Determination which directly follows from it, and that is important because in a parametric model adding more parameters will NEVER make my model's fit worse. It might leave it the same, but even assuming the new parameter has no value I can always set it to 0 and revert to a less parameterized model. For that reason, we need to dock points from the score of models with more parameters to account for the fact that more parameters can significantly improve a model out of sheer luck.
Warning about Overreliance
You could get a really high