[
    {
        "problem_id": 3162,
        "domain": [
            "Mathematics -> Number Theory -> Congruences"
        ],
        "difficulty": 6.0,
        "problem_text": "For distinct positive integers $a, b<2012$, define $f(a, b)$ to be the number of integers $k$ with $1\\le k<2012$ such that the remainder when $ak$ divided by $2012$ is greater than that of $bk$ divided by $2012$. Let $S$ be the minimum value of $f(a, b)$, where $a$ and $b$ range over all pairs of distinct positive integers less than $2012$. Determine $S$.",
        "sample_id": 1,
        "final_answer": "The final answer is \\boxed{502}",
        "steps": [
            {
                "step_id": 1,
                "edge": "We introduce the abbreviation $N=2012$ to simplify subsequent expressions and avoid repetitive writing of the large constant. This substitution is purely notational and leverages standard mathematical practice for handling fixed parameters in modular arithmetic problems.",
                "direct_dependent_steps": null,
                "node": "Let $N=2012$."
            },
            {
                "step_id": 2,
                "edge": "Using the definition $N=2012$ from Step 1, we factorize $N$ into its prime components. Since $2012 \\div 4 = 503$ and $503$ is prime, we obtain $2012 = 4 \\times 503$. This factorization is critical for later analysis of divisors and gcd computations, as the structure of $N$'s divisors depends entirely on its prime decomposition.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "The prime factorization of $N$ is $2012 = 4 \\times 503$."
            },
            {
                "step_id": 3,
                "edge": "We restate the problem's definition of $f(a,b)$ using the abbreviated $N$ from Step 1 for conciseness. Specifically, $f(a,b)$ counts integers $k$ in $[1, N-1]$ where the remainder of $ak$ modulo $N$ exceeds that of $bk$ modulo $N$. This restatement preserves the original meaning while streamlining notation for modular comparisons.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "Define $f(a,b)$ as the number of integers $k$ with $1\\le k< N$ for which $(ak \\bmod N) > (bk \\bmod N)$."
            },
            {
                "step_id": 4,
                "edge": "To analyze the relationship between $ak \\bmod N$ and $bk \\bmod N$, we define $d \\equiv a-b \\pmod{N}$ using the modulus $N$ from Step 1. This difference $d$ captures the essential discrepancy between $a$ and $b$ modulo $N$, simplifying the comparison of linear residues to a single parameter $d$.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "For any pair $(a,b)$ let $d \\equiv a-b \\pmod{N}$."
            },
            {
                "step_id": 5,
                "edge": "Building on the definition of $d$ from Step 4, we formalize the remainders $r_a(k) = ak \\bmod N$ and $r_b(k) = bk \\bmod N$. Their difference satisfies $r_a(k) - r_b(k) \\equiv (a-b)k \\equiv dk \\pmod{N}$ by modular arithmetic properties. This congruence links the remainder comparison directly to the parameter $d$ and $k$, forming the foundation for tie analysis.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "For each $k$ set $r_a(k)= ak \\bmod N$ and $r_b(k)= bk \\bmod N$ so that $r_a(k)-r_b(k) \\equiv d k \\pmod{N}$."
            },
            {
                "step_id": 6,
                "edge": "From Step 5, $r_a(k) - r_b(k) \\equiv dk \\pmod{N}$. A tie $r_a(k) = r_b(k)$ occurs precisely when this difference is $0$ modulo $N$, i.e., $dk \\equiv 0 \\pmod{N}$. This equivalence follows because remainders in $[0, N-1]$ are equal if and only if their difference is congruent to $0$ modulo $N$.",
                "direct_dependent_steps": [
                    5
                ],
                "node": "A tie $r_a(k)=r_b(k)$ occurs exactly when $d k \\equiv 0 \\pmod{N}$."
            },
            {
                "step_id": 7,
                "edge": "To characterize solutions to $dk \\equiv 0 \\pmod{N}$, we define $g = \\gcd(d, N)$ using the modulus $N$ from Step 1. This greatest common divisor determines the number of solutions to the congruence, as standard in linear congruence theory where the solution count depends on $\\gcd(d, N)$.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "Let $g=\\gcd(d,N)$."
            },
            {
                "step_id": 8,
                "edge": "Combining Step 6 (ties require $dk \\equiv 0 \\pmod{N}$) and Step 7 ($g = \\gcd(d, N)$), we recall that the congruence $dk \\equiv 0 \\pmod{N}$ has exactly $g$ solutions modulo $N$, including $k=0$. Since $k \\in \\{1, \\dots, N-1\\}$ excludes $k=0$, the tie count is $g-1$. This follows from the general solution $k = t \\cdot (N/g)$ for $t=0,1,\\dots,g-1$, where $t=0$ gives $k=0$ (excluded).",
                "direct_dependent_steps": [
                    6,
                    7
                ],
                "node": "The number of integers $k$ in $\\{1,\\dots,N-1\\}$ satisfying $d k \\equiv 0 \\pmod{N}$ is $g-1$."
            },
            {
                "step_id": 9,
                "edge": "Using Step 8's tie count ($g-1$) and the total $N-1$ values of $k$, the non-tied count is $(N-1) - (g-1) = N - g$. This arithmetic simplification is straightforward: subtracting the number of ties from the total $k$-values isolates the cases where $r_a(k) \\neq r_b(k)$, which is essential for analyzing $f(a,b)$.",
                "direct_dependent_steps": [
                    8
                ],
                "node": "Hence the number of non-tied $k$ is $(N-1)-(g-1)=N-g$."
            },
            {
                "step_id": 10,
                "edge": "For non-tied $k$ (count $N-g$ from Step 9), exactly one inequality holds: $r_a(k) > r_b(k)$ (counted in $f(a,b)$ from Step 3) or $r_a(k) < r_b(k)$ (counted in $f(b,a)$). Thus $f(a,b) + f(b,a) = N - g$. This partitioning is exhaustive and mutually exclusive by the trichotomy of real numbers, applied to the integer remainders.",
                "direct_dependent_steps": [
                    3,
                    9
                ],
                "node": "For non-tied $k$ exactly one of $r_a(k)>r_b(k)$ or $r_a(k)<r_b(k)$ holds so that $f(a,b)+f(b,a)=N-g$."
            },
            {
                "step_id": 11,
                "edge": "From Step 10, $f(a,b) + f(b,a) = N - g$. To minimize $f(a,b)$, we must maximize $g$ (subject to $g < N$ since $a \\neq b$ implies $d \\not\\equiv 0 \\pmod{N}$). This is because a larger $g$ reduces $N - g$, the total non-tied pairs, thereby constraining the maximum possible value of the smaller of $f(a,b)$ and $f(b,a)$. Minimizing $f(a,b)$ thus requires maximizing $g$ over valid $d = a-b$.",
                "direct_dependent_steps": [
                    10
                ],
                "node": "Minimizing $f(a,b)$ requires maximizing $g$ subject to $g<N$."
            },
            {
                "step_id": 12,
                "edge": "Referencing Step 2's factorization ($N = 4 \\times 503$) and Step 11's requirement to maximize $g < N$, we identify the maximal proper divisor of $N$. The divisors of $2012$ are $1, 2, 4, 503, 1006, 2012$, so the largest proper divisor is $N/2 = 1006$. Hence the maximal $g < N$ is $1006$, achievable when $d$ is a multiple of $1006$ but not $2012$.",
                "direct_dependent_steps": [
                    2,
                    11
                ],
                "node": "The maximal proper divisor of $N=2012$ is $N/2=1006$ so the maximal $g<N$ equals $1006$."
            },
            {
                "step_id": 13,
                "edge": "To achieve $g = 1006$ as established in Step 12, we set $d \\equiv 1006 \\pmod{2012}$. This choice ensures $\\gcd(d, N) = \\gcd(1006, 2012) = 1006$ since $2012 = 2 \\times 1006$, confirming $g = 1006$ as required for maximal $g$.",
                "direct_dependent_steps": [
                    12
                ],
                "node": "Thus we set $d\\equiv1006 \\pmod{2012}$ which gives $g=1006$."
            },
            {
                "step_id": 14,
                "edge": "With $d = 1006$ (Step 13) and ties defined by $dk \\equiv 0 \\pmod{N}$ (Step 6), we solve $1006k \\equiv 0 \\pmod{2012}$. Dividing by $1006$ yields $k \\equiv 0 \\pmod{2}$, so ties occur iff $k$ is even. This follows because $1006k = 2012m$ implies $k = 2m$, confirming even $k$ are the only solutions in $\\{1, \\dots, N-1\\}$.",
                "direct_dependent_steps": [
                    6,
                    13
                ],
                "node": "For $d=1006$ the ties occur exactly for even $k$."
            },
            {
                "step_id": 15,
                "edge": "Combining Step 9 ($N - g = 2012 - 1006 = 1006$ non-tied $k$), Step 13 ($g = 1006$), and Step 14 (ties for even $k$), the non-tied $k$ must be odd. In $\\{1, \\dots, 2011\\}$, there are exactly $1006$ odd integers (e.g., $1, 3, \\dots, 2011$), matching $N - g$. This count is verified by $(2011 + 1)/2 = 1006$.",
                "direct_dependent_steps": [
                    9,
                    13,
                    14
                ],
                "node": "Therefore the non-tied $k$ are exactly the odd integers $1\\le k\\le N-1$, whose count is $1006$."
            },
            {
                "step_id": 16,
                "edge": "For odd $k$ (non-tied per Step 14 and Step 15), Step 5 gives $r_a(k) - r_b(k) \\equiv dk \\equiv 1006k \\pmod{2012}$. Since $k$ is odd, $1006k \\equiv 1006 \\pmod{2012}$ (as $1006 \\times 2 \\equiv 0$). Thus $r_a(k) = (r_b(k) + 1006) \\bmod 2012$, which exceeds $r_b(k)$ iff $r_b(k) < 1006$, i.e., $r_b(k) \\leq 1005$. This equivalence follows from modular reduction: if $r_b(k) \\leq 1005$, $r_a(k) = r_b(k) + 1006 > r_b(k)$; if $r_b(k) \\geq 1006$, $r_a(k) = r_b(k) - 1006 < r_b(k)$.",
                "direct_dependent_steps": [
                    5,
                    13,
                    14
                ],
                "node": "For each odd $k$ one has $r_a(k)>r_b(k)$ if and only if $(b k \\bmod N)\\le1005$."
            },
            {
                "step_id": 17,
                "edge": "From Step 16, $f(a,b)$ equals the count of odd $k$ where $bk \\bmod N \\leq 1005$. To minimize $f(a,b)$, we must minimize this count over valid $b$. This rephrasing focuses the optimization on selecting $b$ to reduce the number of favorable residues for the inequality.",
                "direct_dependent_steps": [
                    16
                ],
                "node": "To minimize $f$ we choose $b$ so as to minimize the number of odd $k$ with $b k\\bmod N\\le1005$."
            },
            {
                "step_id": 18,
                "edge": "To minimize the count in Step 17, we choose $b$ with $\\gcd(b, N) = 2$. This choice is strategic: it ensures the residues $bk \\bmod N$ for odd $k$ distribute evenly across specific residue classes (as justified in Step 19), enabling precise control over how many fall below $1006$.",
                "direct_dependent_steps": [
                    17
                ],
                "node": "A convenient choice is any $b$ with $\\gcd(b,N)=2$."
            },
            {
                "step_id": 19,
                "edge": "Given $\\gcd(b, N) = 2$ (Step 18) and $N = 4 \\times 503$ (Step 2), write $b = 2b_1$ with $\\gcd(b_1, 1006) = 1$ (since $N/2 = 1006$). For odd $k$, $bk = 2b_1k$. As $k$ runs over odd residues mod $N$, $b_1k$ mod $1006$ runs over all odd residues mod $1006$ (due to $\\gcd(b_1, 1006)=1$), so $2b_1k$ mod $2012$ runs over all residues $\\equiv 2 \\pmod{4}$. This follows because odd $b_1k$ implies $2 \\times \\text{odd} \\equiv 2 \\pmod{4}$, and the bijection ensures coverage of all such residues.",
                "direct_dependent_steps": [
                    2,
                    18
                ],
                "node": "If $\\gcd(b,N)=2$ then for odd $k$ the residues $b k\\bmod N$ run precisely over the numbers congruent to $2\\pmod4$."
            },
            {
                "step_id": 20,
                "edge": "From Step 2 ($N = 2012$) and Step 19 (residues $\\equiv 2 \\pmod{4}$), we count residues in $\\{0, \\dots, 2011\\}$ congruent to $2$ mod $4$: $2, 6, 10, \\dots, 2010$. This is an arithmetic sequence with difference $4$, so the count is $((2010 - 2)/4) + 1 = 503$. Sanity check: total residues $2012$, quartered by mod $4$ classes, but $2012 \\div 4 = 503$, so each residue class mod $4$ has exactly $503$ elements.",
                "direct_dependent_steps": [
                    2,
                    19
                ],
                "node": "There are exactly $503$ residues in $\\{0,1,\\dots,N-1\\}$ that are congruent to $2\\pmod4$."
            },
            {
                "step_id": 21,
                "edge": "Using Step 20's $503$ residues $\\equiv 2 \\pmod{4}$, we isolate those $\\leq 1005$: $2, 6, \\dots, 1002$. This sequence has first term $2$, common difference $4$, last term $1002$, so the count is $((1002 - 2)/4) + 1 = 251$. Verification: $1002 = 2 + 4(n-1)$ implies $n = 251$, and $4 \\times 250 = 1000$, so $2 + 1000 = 1002$ is the $251$st term.",
                "direct_dependent_steps": [
                    20
                ],
                "node": "Among these residues those at most $1005$ are $2,6,10,\\dots,1002$, of which there are $251$."
            },
            {
                "step_id": 22,
                "edge": "From Step 15 ($1006$ odd $k$) and Step 19 (residues $bk \\bmod N$ cover $503$ distinct values $\\equiv 2 \\pmod{4}$), each residue arises from exactly $1006 / 503 = 2$ values of $k$. This uniform distribution follows because the map $k \\mapsto bk \\bmod N$ is $2$-to-$1$ over the residue set when $\\gcd(b, N) = 2$, as established by the Chinese Remainder Theorem decomposition in Step 19.",
                "direct_dependent_steps": [
                    15,
                    19
                ],
                "node": "Each such residue arises from exactly two odd values of $k$ modulo $N$ when $\\gcd(b,N)=2$."
            },
            {
                "step_id": 23,
                "edge": "Combining Step 21 ($251$ residues $\\leq 1005$) and Step 22 (each residue from two $k$), the count of odd $k$ with $bk \\bmod N \\leq 1005$ is $2 \\times 251 = 502$. By Step 16, this equals $f(a,b)$ for the chosen $a, b$. This value is achievable (e.g., with $b=2$, $a = b + 1006 = 1008$), and since Step 11 shows $g=1006$ maximizes $g$, $502$ is the minimal possible $f(a,b)$.",
                "direct_dependent_steps": [
                    21,
                    22
                ],
                "node": "Consequently $f(a,b)=2\\times251=502$ for this choice of $a$ and $b$."
            }
        ]
    }
]
