[
    {
        "problem_id": 1672,
        "domain": [
            "Mathematics -> Applied Mathematics -> Statistics -> Mathematical Statistics"
        ],
        "difficulty": 6.0,
        "problem_text": "Let $X_{1}, \\cdots, X_{n}$ be $n$ independent and identically distributed observations from the Cauchy distribution with density function $f(x)=\\frac{1}{\\pi} \\frac{1}{1+(x-\\theta)^{2}}, x \\in \\mathbb{R}$. b) Can you find an unbiased estimator $T$ that attains the lower bound in part a)? If yes, please construct one. If no, please show why such an estimator does not exist.",
        "sample_id": 1,
        "final_answer": "No such unbiased estimator exists",
        "steps": [
            {
                "step_id": 1,
                "edge": "This step establishes the foundational setup of the problem, directly stated in the problem text. The independence and identical distribution (i.i.d.) of the random variables $X_1, \\dots, X_n$ is a critical assumption for constructing the joint likelihood function later, as it allows decomposition of the joint density into a product of individual densities. This property underpins the subsequent summation of log-likelihood contributions across observations.",
                "direct_dependent_steps": null,
                "node": "The random variables $X_1,\\dots,X_n$ are independent and identically distributed."
            },
            {
                "step_id": 2,
                "edge": "The problem explicitly specifies the Cauchy density function for each observation, which serves as the parametric model for inference. This density $f(x) = \\frac{1}{\\pi} \\frac{1}{1+(x-\\theta)^2}$ centers at the unknown location parameter $\\theta$, and its heavy-tailed nature (lack of finite moments) will later explain why the Cramér-Rao bound is unattainable. This definition is essential for deriving likelihood-based quantities in subsequent steps.",
                "direct_dependent_steps": null,
                "node": "Each observation $X_i$ has the Cauchy density $f(x)=\\frac1\\pi\\frac1{1+(x-\\theta)^2}$."
            },
            {
                "step_id": 3,
                "edge": "Building on the density function from Step 2, we compute the log-likelihood contribution for a single observation by applying the natural logarithm to $f(X_i)$. This transformation simplifies differentiation and summation operations: $\\log f(X_i) = \\log \\left( \\frac{1}{\\pi} \\frac{1}{1+(X_i-\\theta)^2} \\right) = -\\log \\pi - \\log\\bigl(1+(X_i-\\theta)^2\\bigr)$. The log converts multiplicative relationships into additive ones, facilitating the derivation of the score function in later steps.",
                "direct_dependent_steps": [
                    2
                ],
                "node": "The log-likelihood contribution for a single observation is $\\ell_i(\\theta)=\\log f(X_i)= -\\log\\pi-\\log\\bigl(1+(X_i-\\theta)^2\\bigr)$."
            },
            {
                "step_id": 4,
                "edge": "Leveraging the i.i.d. assumption from Step 1 and the per-observation log-likelihood from Step 3, we construct the full-sample log-likelihood by summing individual contributions. Independence implies the joint density is the product of marginals, so the logarithm converts this product into a sum: $\\ell(\\theta) = \\sum_{i=1}^n \\ell_i(\\theta)$. This additive structure is crucial for efficiently computing derivatives and expectations across all observations.",
                "direct_dependent_steps": [
                    1,
                    3
                ],
                "node": "The log-likelihood for the full sample is $\\ell(\\theta)=\\sum_{i=1}^n\\ell_i(\\theta)$."
            },
            {
                "step_id": 5,
                "edge": "The score function is defined as the derivative of the log-likelihood with respect to $\\theta$, following standard statistical theory for parameter estimation. As established in Step 4, $\\ell(\\theta)$ is the total log-likelihood, so $\\ell'(\\theta) = \\frac{\\partial}{\\partial \\theta} \\ell(\\theta)$ quantifies the sensitivity of the likelihood to changes in $\\theta$. This derivative is central to the Cramér-Rao bound derivation, as it relates to the Fisher information.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "The score function is defined by $\\ell'(\\theta)=\\frac{\\partial}{\\partial\\theta}\\ell(\\theta)$."
            },
            {
                "step_id": 6,
                "edge": "To compute the per-observation derivative required for the score function, we differentiate the log-likelihood contribution from Step 3 with respect to $\\theta$. Applying the chain rule to $-\\log\\bigl(1+(X_i-\\theta)^2\\bigr)$: the derivative of the outer log function is $\\frac{1}{1+(X_i-\\theta)^2}$, multiplied by the derivative of the inner function $-2(X_i-\\theta) \\cdot (-1) = 2(X_i-\\theta)$. This yields $\\frac{2(X_i-\\theta)}{1+(X_i-\\theta)^2}$, a key component for aggregating the full score.",
                "direct_dependent_steps": [
                    3
                ],
                "node": "The derivative $\\frac{\\partial}{\\partial\\theta}\\bigl(-\\log(1+(X_i-\\theta)^2)\\bigr)$ equals $\\frac{2(X_i-\\theta)}{1+(X_i-\\theta)^2}$."
            },
            {
                "step_id": 7,
                "edge": "Combining the score function definition from Step 5 with the per-observation derivative computed in Step 6, we sum the individual derivatives across all $n$ observations. Since $\\ell(\\theta)$ is additive (Step 4), differentiation distributes over the sum, resulting in $\\ell'(\\theta) = \\sum_{i=1}^n \\frac{2(X_i-\\theta)}{1+(X_i-\\theta)^2}$. This expression captures the total sensitivity of the likelihood to $\\theta$ and will be used to compute the Fisher information.",
                "direct_dependent_steps": [
                    5,
                    6
                ],
                "node": "Therefore the score function is $\\ell'(\\theta)=\\sum_{i=1}^n\\frac{2(X_i-\\theta)}{1+(X_i-\\theta)^2}$."
            },
            {
                "step_id": 8,
                "edge": "The Cramér-Rao inequality provides a fundamental lower bound for the variance of unbiased estimators. As defined in Step 5, $\\ell'(\\theta)$ is the score function, and the Fisher information is $E[\\ell'(\\theta)^2]$. For any unbiased estimator $T$ of $\\theta$, the inequality states $\\mathrm{Var}(T) \\geq 1/E[\\ell'(\\theta)^2]$. This bound is derived from the Cauchy-Schwarz inequality applied to the covariance between $T$ and the score, setting the stage for analyzing attainability.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "By the Cramér–Rao inequality any unbiased estimator $T$ of $\\theta$ satisfies $\\mathrm{Var}(T)\\ge1/E[\\ell'(\\theta)^2]$."
            },
            {
                "step_id": 9,
                "edge": "Expanding on the Cramér-Rao framework in Step 8, the proof relies on the Cauchy-Schwarz inequality applied to the random variables $\\ell'(\\theta)$ and $T - \\theta$. This gives $(E[\\ell'(\\theta)(T - \\theta)])^2 \\leq E[\\ell'(\\theta)^2] E[(T - \\theta)^2]$. The left-hand side simplifies under unbiasedness (as shown later), while the right-hand side links the Fisher information to $\\mathrm{Var}(T)$. Equality in this inequality is necessary for attaining the Cramér-Rao bound.",
                "direct_dependent_steps": [
                    8
                ],
                "node": "The Cauchy–Schwarz inequality in the Cramér–Rao proof gives $(E[\\ell'(\\theta)(T-\\theta)])^2\\le E[\\ell'(\\theta)^2]E[(T-\\theta)^2]$."
            },
            {
                "step_id": 10,
                "edge": "This step states a basic definition: an unbiased estimator $T$ of $\\theta$ satisfies $E[T] = \\theta$, which immediately implies $E[T - \\theta] = 0$. This zero-mean property of the estimation error is a prerequisite for applying the Cramér-Rao inequality and will be differentiated later to establish a key identity involving the score function.",
                "direct_dependent_steps": null,
                "node": "Unbiasedness of $T$ implies $E[T-\\theta]=0$."
            },
            {
                "step_id": 11,
                "edge": "Differentiating the unbiasedness condition $E[T - \\theta] = 0$ (Step 10) with respect to $\\theta$ under the expectation operator—a standard operation in regularity conditions—yields $E[\\ell'(\\theta)(T - \\theta)] = 1$. This follows from Leibniz's rule: the derivative of $E[T - \\theta]$ is $E[\\frac{\\partial}{\\partial \\theta} \\log f(X) \\cdot (T - \\theta)] + E[\\frac{\\partial}{\\partial \\theta}(T - \\theta)]$, where $\\frac{\\partial}{\\partial \\theta} \\log f(X) = \\ell'(\\theta)$ from Step 5 and $\\frac{\\partial}{\\partial \\theta}(T - \\theta) = -1$. The result $E[\\ell'(\\theta)(T - \\theta)] = 1$ quantifies the essential covariance for the bound.",
                "direct_dependent_steps": [
                    5,
                    10
                ],
                "node": "Differentiating the unbiasedness condition yields $E[\\ell'(\\theta)(T-\\theta)]=1$."
            },
            {
                "step_id": 12,
                "edge": "The Cauchy-Schwarz inequality in Step 9 achieves equality if and only if $T - \\theta$ is linearly dependent on $\\ell'(\\theta)$. Given from Step 11 that $E[\\ell'(\\theta)(T - \\theta)] = 1$, this proportionality condition requires $T - \\theta = c \\, \\ell'(\\theta)$ almost surely for some constant $c$. This characterization identifies the exact form any estimator must take to attain the Cramér-Rao bound, making it a critical bridge to the final conclusion.",
                "direct_dependent_steps": [
                    9,
                    11
                ],
                "node": "Equality in Cauchy–Schwarz holds if and only if $T-\\theta=c\\,\\ell'(\\theta)$ almost surely for some constant $c$."
            },
            {
                "step_id": 13,
                "edge": "Substituting the explicit score function from Step 7 into the equality condition $T - \\theta = c \\, \\ell'(\\theta)$ derived in Step 12, we obtain $T = \\theta + c \\sum_{i=1}^n \\frac{2(X_i - \\theta)}{1 + (X_i - \\theta)^2}$. This parametrizes all potential estimators that could achieve the Cramér-Rao bound, revealing that $T$ must depend on $\\theta$ through both the additive term and the summation. Such dependence violates the definition of a valid estimator, as shown next.",
                "direct_dependent_steps": [
                    7,
                    12
                ],
                "node": "Hence any estimator attaining the bound must satisfy $T=\\theta+c\\sum_{i=1}^n\\frac{2(X_i-\\theta)}{1+(X_i-\\theta)^2}$."
            },
            {
                "step_id": 14,
                "edge": "From the expression in Step 13, $T$ explicitly contains the unknown parameter $\\theta$ in multiple places: as a standalone additive term and within each $(X_i - \\theta)$ in the sum. This direct dependence on $\\theta$ means $T$ cannot be computed from observed data alone, as $\\theta$ is precisely what we aim to estimate. Consequently, no estimator of this form qualifies as a legitimate statistic, which must be a function of the sample $X_1, \\dots, X_n$ only.",
                "direct_dependent_steps": [
                    13
                ],
                "node": "This expression for $T$ explicitly involves the unknown parameter $\\theta$."
            },
            {
                "step_id": 15,
                "edge": "By definition, a statistical estimator must be a measurable function of the observed data, independent of any unknown parameters. Step 14 demonstrates that any candidate estimator achieving the Cramér-Rao bound inherently requires knowledge of $\\theta$, rendering it non-computable in practice. This contradiction—between the bound-attaining form and the definition of an estimator—proves such estimators are invalid, regardless of the constant $c$.",
                "direct_dependent_steps": [
                    14
                ],
                "node": "An estimator cannot depend on the unknown parameter $\\theta$."
            },
            {
                "step_id": 16,
                "edge": "Combining the impossibility in Step 14 (bound-attaining estimators require $\\theta$) with Step 15's requirement that estimators cannot depend on $\\theta$, we conclude no unbiased estimator can satisfy both the Cramér-Rao equality condition and the definition of an estimator. The Cauchy distribution's lack of finite variance (implied by its heavy tails) exacerbates this, but the core issue is the parameter dependence in the optimal form. Thus, the lower bound is unattainable for unbiased estimators of $\\theta$.",
                "direct_dependent_steps": [
                    14,
                    15
                ],
                "node": "Therefore no unbiased estimator attains the Cramér–Rao lower bound."
            },
            {
                "step_id": 17,
                "edge": "The conclusion from Step 16 directly answers the problem's query: since no unbiased estimator can attain the Cramér-Rao lower bound due to the inescapable dependence on $\\theta$, the final answer is unequivocally that no such estimator exists. This boxed result synthesizes the theoretical chain from the score function's structure to the definition of valid estimation.",
                "direct_dependent_steps": [
                    16
                ],
                "node": "The final answer is $\\boxed{\\text{No such unbiased estimator exists}}$."
            }
        ]
    }
]
