{
  "title": "How to Compute the Unbiased Estimator of Error Variance",
  "turns": [
    {
      "turn": 1,
      "user": "How do you find the unbiased estimator of the error variance?",
      "assistant": "Here’s how to compute the unbiased estimator ( $s^{2}$ ) of the error variance ( $\\sigma ^{2}$ ).\n\n### 1\\. Basic concepts\n\n*   **Error variance ( $\\sigma ^{2}$ )**: In a regression model, this is the variance of the errors ( $ϵ_{i}$ ) between the observed values and the values predicted by the regression line. That is,  $Var\\left(ϵ_{i}\\right)=E\\left[\\left(ϵ_{i}-E\\left[ϵ_{i}\\right]\\right)^{2}\\right]=E\\left[ϵ_{i2}\\right]=\\sigma ^{2}$  (assuming  $E\\left[ϵ_{i}\\right]=0$ ).\n*   **Unbiased estimator (Unbiased Estimator)**: An estimator ( $\\theta ^$ ) of a parameter ( $\\theta$ ) is called unbiased if its expectation ( $E\\left[\\theta ^\\right]$ ) equals the parameter, i.e.,  $E\\left[\\theta ^\\right]=\\theta$ .\n*   **Residual (Residual)**: The difference between the actual observed value ( $y_{i}$ ) and the value predicted by the regression model ( $y^​_{i}$ ). That is,  $e_{i}=y_{i}-y^​_{i}$ . A residual is an estimate of the error ( $ϵ_{i}$ ).\n\n### 2\\. Residual Sum of Squares (Residual Sum of Squares, RSS)\n\nTo obtain  $s^{2}$ , first compute the Residual Sum of Squares (RSS). RSS is the sum of squared residuals ( $e_{i}$ ).\n\n$$\nRSS=\\sum_{i=1}^{n} e_{i2}=\\sum_{i=1}^{n} \\left(y_{i}-y^​_{i}\\right)^{2}\n$$\n\nHere,  $n$  is the number of observations.\n\n### 3\\. Formula for the unbiased estimator\n\nThe unbiased estimator ( $s^{2}$ ) of the error variance ( $\\sigma ^{2}$ ) is the Residual Sum of Squares (RSS) divided by the degrees of freedom.\n\n$$\ns^{2}=\\frac{RSS}{n-p}=\\frac{\\sum_{i=1}^{n} \\left(y_{i}-y^​_{i}\\right)^{2}}{n-p}\n$$\n\n*   ** $n$ **: Number of observations.\n*   ** $p$ **: Number of parameters (regression coefficients) estimated in the model.\n    *   For a simple linear regression model ( $y_{i}=\\beta _{0}+\\beta _{1}x_{i}+ϵ_{i}$ ), there are two parameters to estimate,  $\\beta _{0}$  and  $\\beta _{1}$ , so  $p=2$ .\n    *   Therefore, in simple linear regression, \\$s^2 = \\\\frac{RSS}{n-2}\\$.\n    *   In general, for a multiple regression model with  $k$  explanatory variables including an intercept,  $p=k+1$ .\n\n### 4\\. Why divide by  $n-p$ ? (Proof of unbiasedness)\n\nIf you simply divide the Residual Sum of Squares by  $n$  ( $\\frac{RSS}{n}$ ), the result is a biased estimator; that is,  $E\\left[\\frac{RSS}{n}\\right]\\ne \\sigma ^{2}$ . In fact, one can show that\n\n$$\nE\\left[RSS\\right]=E\\left[\\sum_{i=1}^{n} e_{i2}\\right]=\\left(n-p\\right)\\sigma ^{2}\n$$\n\nTherefore,\n\n$$\nE\\left[s^{2}\\right]=E\\left[\\frac{RSS}{n-p}\\right]=\\frac{1}{n-p}E\\left[RSS\\right]=\\frac{1}{n-p}\\left(n-p\\right)\\sigma ^{2}=\\sigma ^{2}\n$$\n\nSince the expectation of  $s^{2}$  equals  $\\sigma ^{2}$ ,  $s^{2}$  is an unbiased estimator of  $\\sigma ^{2}$ . Here,  $n-p$  is called the Residual Degrees of Freedom. It represents the number of residuals that can vary independently out of the  $n$  residuals. Estimating  $p$  regression coefficients imposes  $p$  linear constraints, reducing the degrees of freedom by  $p$ .\n\n### 5\\. Summary\n\nTo compute the unbiased estimator ( $s^{2}$ ) of the error variance ( $\\sigma ^{2}$ ):\n\n1.  Fit the regression model and compute the fitted values ( $y^​_{i}$ ).\n2.  Compute the residuals for each observation ( $e_{i}=y_{i}-y^​_{i}$ ).\n3.  Compute the Residual Sum of Squares ( $RSS=\\sum_{i=1}^{n} e_{i2}$ ).\n4.  Divide  $RSS$  by the degrees of freedom,  $n-p$ , accounting for the number of parameters ( $p$ ) used in the model.\n    $$\n    s^{2}=\\frac{RSS}{n-p}\n    $$"
    },
    {
      "turn": 2,
      "user": "So it's the average of the residuals, then?",
      "assistant": "That's not correct. The **mean of the residuals is always 0**.\n\nThe reason the residual mean is 0 lies in the regression estimation process. When estimating the regression coefficients using least squares (OLS, Ordinary Least Squares), one of the basic properties is that the sum of the residuals is zero.\n\n$$\n\\sum_{i=1}^{n} e_{i}=\\sum_{i=1}^{n} \\left(y_{i}-y^​_{i}\\right)=0\n$$\n\nTherefore, the residual mean ( $eˉ$ ) is computed as follows and is always 0.\n\n$$\neˉ=\\frac{1}{n}\\sum_{i=1}^{n} e_{i}=\\frac{0}{n}=0\n$$\n\nIt's true that we use the residual sum of squares ( $∑e_{i2}$ ) when estimating the error variance ( $\\sigma ^{2}$ ), but don't mistake this for the mean of the residuals. **The residual sum of squares divided by the degrees of freedom ( $n-p$ )** is the unbiased estimator  $s^{2}$ of the error variance."
    },
    {
      "turn": 3,
      "user": "Oh, sorry — I meant residual sum of squares/residual degrees of freedom.",
      "assistant": "Ah, yes, that's right. The residual sum of squares divided by the residual degrees of freedom is the unbiased estimator of the error variance.\n\nMore precisely, it's as follows:\n\n*   **Residual sum of squares(Residual Sum of Squares, RSS)**: The sum of the squares of the residuals ( $e_{i}$ ).\n    $$\n    RSS=\\sum_{i=1}^{n} e_{i2}=\\sum_{i=1}^{n} \\left(y_{i}-y^​_{i}\\right)^{2}\n    $$\n*   **Residual degrees of freedom(Residual Degrees of Freedom)**: The total number of observations ( $n$ ) minus the number of parameters ( $p$ ) used to fit the model.\n    $$\n    Degrees of freedom=n-p\n    $$\n    Here, $p$ is the number of estimated coefficients including the intercept. For example, in simple linear regression ( $y=\\beta _{0}+\\beta _{1}x+ϵ$ ), we have $p=2$.\n\nTherefore, \\*\\*the unbiased estimator of the error variance ( $\\sigma ^{2}$ ) ( $s^{2}$ )\\*\\* can be written using these two as:\n\n$$\ns^{2}=\\frac{RSS}{n-p}\n$$\n\nThis $s^{2}$ is exactly the residual sum of squares divided by the residual degrees of freedom, and it is an unbiased estimator of $\\sigma ^{2}$. This is also called the \\*\\*mean squared error(Mean Squared Error, MSE)\\*\\*."
    }
  ]
}