CEF and the assumptions for regression
Recently was discussing with a friend (Mashrur bhai) about the assumptions for regression, and I thought it would be useful to write down some of the key points. There are essentially two approaches we can introduce the assumptions,
The idea is we start from the idea of CEF, since this is the best prediction function. Here is the setup
Setup: Suppose we have a random variable $Y$ and we would like to predict it using a set of random variables $X = (X_1, X_2, \ldots, X_k)$, where $X$ is a vector of random variables, i.e.,
\[X = \begin{pmatrix} X_1 \\ X_2 \\ \vdots \\ X_k \end{pmatrix}\]Then the CEF is defined as