Assumptions for Regression

CEF and the assumptions for regression

Regression Assumptions

Recently was discussing with a friend (Mashrur bhai) about the assumptions for regression, and I thought it would be useful to write down some of the key points. There are essentially two approaches we can introduce the assumptions,

The idea is we start from the idea of CEF, since this is the best prediction function. Here is the setup

Setup: Suppose we have a random variable $Y$ and we would like to predict it using a set of random variables $X = (X_1, X_2, \ldots, X_k)$, where $X$ is a vector of random variables, i.e.,

\[X = \begin{pmatrix} X_1 \\ X_2 \\ \vdots \\ X_k \end{pmatrix}\]

Then the CEF is defined as