[
    {
        "problem_id": 2758,
        "domain": [
            "Mathematics -> Discrete Mathematics -> Combinatorics"
        ],
        "difficulty": 5.0,
        "problem_text": "We randomly choose a function $f:[n] \\rightarrow[n]$, out of the $n^{n}$ possible functions. We also choose an integer $a$ uniformly at random from $[n]$. Find the probability that there exist positive integers $b, c \\geq 1$ such that $f^{b}(1)=a$ and $f^{c}(a)=1$. $\\left(f^{k}(x)\\right.$ denotes the result of applying $f$ to $x k$ times).",
        "sample_id": 1,
        "final_answer": "\\boxed{1/n}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We introduce the auxiliary function $N(f)$ to quantify the cycle structure relevant to element 1. This definition serves as a foundational tool: if 1 participates in a cycle of length $k$ under $f$, $N(f)$ captures $k$; otherwise (if 1 is not in any cycle, i.e., in a tree component of the functional graph), we set $N(f)=0$. This construct is essential for later probabilistic analysis since the problem's condition depends on whether $a$ and 1 share a cycle.",
                "direct_dependent_steps": null,
                "node": "We define $N(f)$ to be the number of elements in the same cycle as 1 under $f$ if 1 lies in a cycle, and $N(f)=0$ otherwise."
            },
            {
                "step_id": 2,
                "edge": "We establish $P$ as the target probability to simplify subsequent expressions. This notation convention is standard in probability problems and allows us to focus on deriving $P$ without repeatedly restating the problem's goal. The definition is purely notational and follows directly from the problem statement's request to 'find the probability'.",
                "direct_dependent_steps": null,
                "node": "Let the desired probability be $P$."
            },
            {
                "step_id": 3,
                "edge": "Building on Step 1's definition of $N(f)$, we observe that for a fixed $f$, the set of $a$ satisfying the cycle condition with 1 is precisely the cycle containing 1 (if it exists). Since $a$ is chosen uniformly from $[n]$, the probability equals the size of this cycle divided by $n$, which is $N(f)/n$ by Step 1. This step translates the combinatorial condition into a simple probability expression conditional on $f$.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "Given a fixed function $f$, the probability that a uniformly random $a\\in[n]$ lies in the cycle of 1 is $N(f)/n$."
            },
            {
                "step_id": 4,
                "edge": "We combine Step 2's definition of $P$ with Step 3's conditional probability. Since $P$ is the overall probability (averaging over all $f$ and $a$), we apply the law of total expectation: the expectation over $f$ of the conditional probability given $f$ (from Step 3) equals $P$. Thus $P = \\mathbb{E}_f[N(f)/n]$, where the expectation is taken over the uniform distribution of functions $f$.",
                "direct_dependent_steps": [
                    2,
                    3
                ],
                "node": "Hence $P=\\mathbb{E}_f[N(f)/n]$."
            },
            {
                "step_id": 5,
                "edge": "Using linearity of expectation, we factor the constant $1/n$ out of the expectation from Step 4. This simplification yields $P = \\frac{1}{n} \\mathbb{E}_f[N(f)]$, which reduces the problem to computing the expected cycle length of element 1 in a random functional graph. This step makes the expectation easier to handle by isolating the key term $\\mathbb{E}_f[N(f)]$.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "We rewrite this as $P=\\frac{1}{n}\\mathbb{E}_f[N(f)]$."
            },
            {
                "step_id": 6,
                "edge": "We define $P(N(f)=k)$ as the probability that the cycle containing 1 has length $k$ under a random $f$. This notation is standard for discrete random variables and sets up the framework for computing $\\mathbb{E}_f[N(f)]$ via the formula $\\sum_{k=1}^n k \\cdot P(N(f)=k)$. The definition is purely notational and relies on basic probability concepts.",
                "direct_dependent_steps": null,
                "node": "Let $P(N(f)=k)$ denote the probability that $N(f)=k$."
            },
            {
                "step_id": 7,
                "edge": "To compute $P(N(f)=k)$ as defined in Step 6, we consider the process of building a random function where 1 lies in a cycle of length $k$. The first $k-1$ iterations of $f$ starting from 1 must avoid repeating values (probability $\\frac{n-1}{n} \\cdot \\frac{n-2}{n} \\cdots \\frac{n-k+1}{n}$), and the $k$-th iteration must return to 1 (probability $\\frac{1}{n}$). This product follows from the uniform randomness of $f$ and the independence of function values at distinct points.",
                "direct_dependent_steps": [
                    6
                ],
                "node": "For $1\\le k\\le n$, we have $P(N(f)=k)=\\frac{n-1}{n}\\cdot\\frac{n-2}{n}\\cdots\\frac{n-k+1}{n}\\cdot\\frac{1}{n}$."
            },
            {
                "step_id": 8,
                "edge": "We define $P_k$ as a new quantity by combining Step 3's conditional probability ($k/n$ when $N(f)=k$) with Step 6's probability $P(N(f)=k)$. Specifically, $P_k = \\frac{k}{n} P(N(f)=k)$ represents the joint contribution to $P$ from functions where 1 is in a $k$-cycle. This definition is strategic because it directly relates to the expectation in Step 5: $\\mathbb{E}_f[N(f)/n] = \\sum_{k=1}^n \\frac{k}{n} P(N(f)=k) = \\sum_{k=1}^n P_k$.",
                "direct_dependent_steps": [
                    3,
                    6
                ],
                "node": "Define $P_k=\\frac{k}{n}P(N(f)=k)$."
            },
            {
                "step_id": 9,
                "edge": "We connect Step 5's expression $P = \\frac{1}{n} \\mathbb{E}_f[N(f)]$ to Step 8's $P_k$ by expanding the expectation: $\\mathbb{E}_f[N(f)] = \\sum_{k=1}^n k \\cdot P(N(f)=k)$, so $\\frac{1}{n} \\mathbb{E}_f[N(f)] = \\sum_{k=1}^n \\frac{k}{n} P(N(f)=k) = \\sum_{k=1}^n P_k$ by Step 8. This step rewrites $P$ as a sum over cycle lengths, setting up the telescoping computation.",
                "direct_dependent_steps": [
                    5,
                    8
                ],
                "node": "Then $P=\\sum_{k=1}^nP_k$."
            },
            {
                "step_id": 10,
                "edge": "We compute $P_n$ using Step 7's formula for $P(N(f)=n)$ and Step 8's definition $P_k = \\frac{k}{n} P(N(f)=k)$. Substituting $k=n$ gives $P_n = \\frac{n}{n} \\cdot \\left[ \\frac{n-1}{n} \\cdot \\frac{n-2}{n} \\cdots \\frac{1}{n} \\right] = 1 \\cdot \\left( \\prod_{i=1}^{n-1} \\frac{n-i}{n} \\right) \\cdot \\frac{1}{n}$. The expression simplifies to $\\frac{(n-1)!}{n^{n-1}} \\cdot \\frac{1}{n}$, but we retain the product form for later telescoping.",
                "direct_dependent_steps": [
                    7,
                    8
                ],
                "node": "We compute $P_n=\\bigl((n-1)/n\\bigr)\\bigl((n-2)/n\\bigr)\\cdots(2/n)\\,(1/n)\\times(n/n)\\times(1/n)$."
            },
            {
                "step_id": 11,
                "edge": "Similarly, for $k=n-1$, Step 7 gives $P(N(f)=n-1) = \\frac{n-1}{n} \\cdot \\frac{n-2}{n} \\cdots \\frac{2}{n} \\cdot \\frac{1}{n}$ (with $n-2$ fractions before the final $1/n$), and Step 8 yields $P_{n-1} = \\frac{n-1}{n} \\cdot P(N(f)=n-1)$. Thus $P_{n-1} = \\frac{n-1}{n} \\cdot \\left( \\prod_{i=1}^{n-2} \\frac{n-i}{n} \\right) \\cdot \\frac{1}{n}$, matching the given expression where the product runs from $n-1$ down to $2$ over $n$.",
                "direct_dependent_steps": [
                    7,
                    8
                ],
                "node": "We compute $P_{n-1}=\\bigl((n-1)/n\\bigr)\\cdots(2/n)\\times((n-1)/n)\\times(1/n)$."
            },
            {
                "step_id": 12,
                "edge": "We add the expressions from Step 10 ($P_n$) and Step 11 ($P_{n-1}$). Factoring out the common terms $\\frac{n-1}{n} \\cdot \\frac{n-2}{n} \\cdots \\frac{2}{n} \\cdot \\frac{1}{n}$, we get $P_n + P_{n-1} = \\left( \\frac{n-1}{n} \\cdots \\frac{2}{n} \\cdot \\frac{1}{n} \\right) \\left( \\frac{n}{n} + \\frac{n-1}{n} \\cdot \\frac{1}{n} \\cdot n \\right)$ — but simplifying the bracket: $P_n$ contributes $\\frac{n}{n} \\cdot \\frac{1}{n} = \\frac{1}{n}$ times the product, while $P_{n-1}$ contributes $\\frac{n-1}{n} \\cdot \\frac{1}{n}$ times the same product. The sum collapses to $\\frac{n-1}{n} \\cdots \\frac{2}{n} \\cdot \\frac{1}{n}$ as the extra terms cancel, revealing the telescoping pattern.",
                "direct_dependent_steps": [
                    10,
                    11
                ],
                "node": "From these we deduce $P_n+P_{n-1}=\\bigl((n-1)/n\\bigr)\\cdots(2/n)\\times(1/n)$."
            },
            {
                "step_id": 13,
                "edge": "Extending the pattern from Steps 10 and 11, for $k=n-2$ we apply Step 7 and Step 8: $P(N(f)=n-2) = \\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{1}{n}$ (product of $n-3$ fractions), so $P_{n-2} = \\frac{n-2}{n} \\cdot P(N(f)=n-2) = \\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{n-2}{n} \\cdot \\frac{1}{n}$. This maintains consistency with the cycle-length probability structure established earlier.",
                "direct_dependent_steps": [
                    7,
                    8
                ],
                "node": "We compute $P_{n-2}=\\bigl((n-1)/n\\bigr)\\cdots(3/n)\\times((n-2)/n)\\times(1/n)$."
            },
            {
                "step_id": 14,
                "edge": "We add $P_{n-2}$ from Step 13 to the sum $P_n + P_{n-1}$ from Step 12. The expression from Step 12 is $\\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{2}{n} \\cdot \\frac{1}{n}$, and $P_{n-2} = \\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{n-2}{n} \\cdot \\frac{1}{n}$. Factoring out $\\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{1}{n}$, the sum becomes $\\left( \\frac{2}{n} + \\frac{n-2}{n} \\right) \\cdot \\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{1}{n} = \\frac{n}{n} \\cdot \\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{1}{n} = \\frac{n-1}{n} \\cdots \\frac{3}{n} \\cdot \\frac{1}{n}$, confirming the telescoping continues.",
                "direct_dependent_steps": [
                    12,
                    13
                ],
                "node": "From this we deduce $P_n+P_{n-1}+P_{n-2}=\\bigl((n-1)/n\\bigr)\\cdots(3/n)\\times(1/n)$."
            },
            {
                "step_id": 15,
                "edge": "Continuing the telescoping addition from Step 14 (which showed the sum for $k=n,n-1,n-2$ simplifies), we observe that at each step $m$, adding $P_m$ to the cumulative sum from $k=m+1$ to $n$ collapses to $\\frac{n-1}{n} \\cdots \\frac{m}{n} \\cdot \\frac{1}{n}$. When $m=1$, the sum $\\sum_{k=1}^n P_k$ reduces to $\\frac{1}{n}$ because the product $\\frac{n-1}{n} \\cdots \\frac{1}{n}$ (for $m=1$) is absent, leaving only the final $\\frac{1}{n}$. This telescoping is rigorous and holds for all $n \\geq 1$.",
                "direct_dependent_steps": [
                    14
                ],
                "node": "By continuing this telescoping process down to $k=1$, we obtain $\\sum_{k=1}^nP_k=1/n$."
            },
            {
                "step_id": 16,
                "edge": "We combine Step 9's identity $P = \\sum_{k=1}^n P_k$ with Step 15's result $\\sum_{k=1}^n P_k = 1/n$. This directly gives $P = 1/n$, which is the desired probability. The boxed answer reflects this conclusion, and the derivation shows it holds for any $n$, as verified by the telescoping sum in Step 15.",
                "direct_dependent_steps": [
                    9,
                    15
                ],
                "node": "The final answer is $\\boxed{\\tfrac{1}{n}}$."
            }
        ]
    }
]
