[
    {
        "problem_id": 1253,
        "domain": [
            "Mathematics -> Discrete Mathematics -> Combinatorics"
        ],
        "difficulty": 5.25,
        "problem_text": "Find the smallest \\(k\\) such that for any arrangement of 3000 checkers in a \\(2011 \\times 2011\\) checkerboard, with at most one checker in each square, there exist \\(k\\) rows and \\(k\\) columns for which every checker is contained in at least one of these rows or columns.",
        "sample_id": 1,
        "final_answer": "1006",
        "steps": [
            {
                "step_id": 1,
                "edge": "We begin by recalling the fundamental structure of the checkerboard as defined in the problem statement. A square grid of size $n \\times n$ inherently contains exactly $n$ rows and $n$ columns by geometric definition. Here, $n=2011$, so the board has precisely 2011 rows and 2011 columns. This baseline fact establishes the dimensional framework for all subsequent reasoning about row and column selections.",
                "direct_dependent_steps": null,
                "node": "A $2011\\times2011$ checkerboard has $2011$ rows and $2011$ columns."
            },
            {
                "step_id": 2,
                "edge": "This step directly states the given constraint from the problem: 3000 checkers are placed on the board with no more than one checker per square. This numerical input is critical for later calculations involving checker distribution and coverage requirements. The constraint ensures we work within a well-defined combinatorial scenario where checker positions form a partial permutation matrix.",
                "direct_dependent_steps": null,
                "node": "There are $3000$ checkers placed on the board with at most one checker per square."
            },
            {
                "step_id": 3,
                "edge": "Here we formally restate the problem's objective: finding the minimal $k$ such that any checker arrangement admits a covering set of $k$ rows and $k$ columns. This defines our target as a universal guarantee—valid for all possible placements—and sets the stage for both lower-bound construction (via worst-case arrangements) and upper-bound verification (via existence proofs).",
                "direct_dependent_steps": null,
                "node": "We seek the smallest integer $k$ such that for any arrangement of these checkers there exist $k$ rows and $k$ columns covering all checkers."
            },
            {
                "step_id": 4,
                "edge": "To establish a lower bound for $k$, we construct a specific adversarial arrangement: placing one checker on each main diagonal square. This diagonal configuration is chosen because it maximizes the 'spread' of checkers across rows and columns, forcing any covering set to address each checker's unique row-column pairing. Such extremal examples are standard in combinatorial minimax problems to test coverage efficiency.",
                "direct_dependent_steps": null,
                "node": "Consider the arrangement in which one checker occupies each square on the main diagonal."
            },
            {
                "step_id": 5,
                "edge": "Building on Step 1's board dimension, we note that a square matrix of size $2011 \\times 2011$ has exactly 2011 squares on its main diagonal (where row index equals column index). This follows directly from the definition of a diagonal in grid geometry—each index $i$ from 1 to 2011 contributes one diagonal square $(i,i)$. Thus, the diagonal arrangement in Step 4 contains precisely 2011 checkers.",
                "direct_dependent_steps": [
                    1
                ],
                "node": "The main diagonal of a $2011\\times2011$ board contains $2011$ squares."
            },
            {
                "step_id": 6,
                "edge": "Given the diagonal arrangement from Step 4, each row intersects the main diagonal at exactly one square (namely $(i,i)$ for row $i$). Therefore, when we select any set of rows, each selected row covers at most one diagonal checker—specifically the one in its intersection with the diagonal. This constraint arises because no row contains multiple diagonal squares in a standard grid.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "Each selected row intersects the main diagonal in at most one square."
            },
            {
                "step_id": 7,
                "edge": "Similarly, for the diagonal arrangement in Step 4, each column intersects the main diagonal at exactly one square (the square $(j,j)$ for column $j$). Consequently, any selected column covers at most one diagonal checker. This symmetry between rows and columns is inherent to the diagonal structure and will be crucial for bounding total coverage.",
                "direct_dependent_steps": [
                    4
                ],
                "node": "Each selected column intersects the main diagonal in at most one square."
            },
            {
                "step_id": 8,
                "edge": "Combining Step 3's coverage goal with Steps 6 and 7's constraints: selecting $k$ rows covers at most $k$ diagonal checkers (one per row), and $k$ columns covers at most $k$ additional diagonal checkers (one per column). Since coverage is union-based (a checker is covered if in any selected row or column), the total diagonal checkers covered cannot exceed $k + k = 2k$. This upper bound holds regardless of overlap because even if a checker is covered by both a row and column, it still counts as only one covered checker.",
                "direct_dependent_steps": [
                    3,
                    6,
                    7
                ],
                "node": "Therefore selecting $k$ rows and $k$ columns covers at most $2k$ diagonal checkers."
            },
            {
                "step_id": 9,
                "edge": "To cover all diagonal checkers identified in Step 5 (2011 total), the coverage bound from Step 8 requires $2k \\geq 2011$. This inequality is necessary because if $2k < 2011$, Step 8 guarantees insufficient coverage for the diagonal arrangement—violating the universal coverage requirement in Step 3. Thus, any valid $k$ must satisfy this condition to handle this worst-case scenario.",
                "direct_dependent_steps": [
                    5,
                    8
                ],
                "node": "To cover all $2011$ diagonal checkers requires $2k\\ge2011$."
            },
            {
                "step_id": 10,
                "edge": "Solving $2k \\geq 2011$ from Step 9 yields $k \\geq 1005.5$. Since $k$ must be an integer (as it counts rows/columns), we round up to $k \\geq 1006$. Verification: $2 \\times 1005 = 2010 < 2011$ (insufficient), while $2 \\times 1006 = 2012 \\geq 2011$ (sufficient). This establishes 1006 as the minimal integer lower bound for $k$.",
                "direct_dependent_steps": [
                    9
                ],
                "node": "From $2k\\ge2011$ it follows that $k\\ge1006$."
            },
            {
                "step_id": 11,
                "edge": "Having established $k \\geq 1006$ in Step 10, we now prove sufficiency: $k=1006$ works for any arrangement of 3000 checkers. This requires showing that for every possible placement, we can find 1006 rows and 1006 columns covering all checkers. The proof will use a case analysis based on checker distribution in selected rows.",
                "direct_dependent_steps": [
                    10
                ],
                "node": "We now show that $k=1006$ suffices for any arrangement of $3000$ checkers."
            },
            {
                "step_id": 12,
                "edge": "To construct a covering set, we select the 1006 rows with the highest checker counts (using Step 1's 2011 total rows). This greedy approach maximizes coverage from rows alone and minimizes the residual checkers needing column coverage. Step 11's sufficiency goal justifies this strategy, as concentrating coverage in dense rows reduces the column burden.",
                "direct_dependent_steps": [
                    1,
                    11
                ],
                "node": "Among the $2011$ rows, select the $1006$ rows containing the greatest numbers of checkers."
            },
            {
                "step_id": 13,
                "edge": "After selecting the top 1006 rows in Step 12, we relabel the board without loss of generality so these become rows 1–1006. This simplifies notation by fixing the selected rows as the first block, leveraging the problem's symmetry—all row orderings are equivalent for coverage analysis.",
                "direct_dependent_steps": [
                    12
                ],
                "node": "By relabeling rows, assume these selected rows are rows $1$ through $1006$."
            },
            {
                "step_id": 14,
                "edge": "With rows labeled per Step 13, we define $R$ as the total checkers in the remaining rows (1007–2011). Since there are $2011 - 1006 = 1005$ such rows, $R$ quantifies the 'uncovered' checkers if we only use the selected rows. This residual count drives our case analysis for column selection.",
                "direct_dependent_steps": [
                    13
                ],
                "node": "Let $R$ denote the number of checkers in rows $1007$ through $2011$."
            },
            {
                "step_id": 15,
                "edge": "If $R \\leq 1006$ (as defined in Step 14), then at most 1006 checkers lie outside the selected rows (1–1006). This case is favorable because the residual checkers are few enough to be individually covered by columns without exceeding the $k=1006$ column budget.",
                "direct_dependent_steps": [
                    14
                ],
                "node": "If $R\\le1006$, then there are at most $1006$ checkers outside rows $1$ through $1006$."
            },
            {
                "step_id": 16,
                "edge": "Each checker not in rows 1–1006 occupies a unique column by the problem's 'at most one checker per square' rule. Though multiple checkers could share a column, each resides in exactly one column—ensuring we can always assign a column to cover any individual checker.",
                "direct_dependent_steps": null,
                "node": "Each checker outside rows $1$ through $1006$ lies in exactly one column."
            },
            {
                "step_id": 17,
                "edge": "Given Step 15's $R \\leq 1006$ residual checkers and Step 16's column uniqueness, we select one distinct column per residual checker. This covers all residual checkers because each is in its assigned column. While some columns might cover multiple checkers, selecting one per checker (up to $R$ columns) guarantees coverage without exceeding the residual count.",
                "direct_dependent_steps": [
                    15,
                    16
                ],
                "node": "Therefore we can select one column per such checker to cover it individually."
            },
            {
                "step_id": 18,
                "edge": "Step 15 ensures $R \\leq 1006$, and Step 17 uses exactly $R$ columns for residual checkers. Thus, the column count for coverage is at most 1006. This satisfies the column budget constraint for $k=1006$, leaving room to possibly add columns if needed.",
                "direct_dependent_steps": [
                    15,
                    17
                ],
                "node": "That selection uses at most $R\\le1006$ columns."
            },
            {
                "step_id": 19,
                "edge": "Since Step 18 uses at most 1006 columns and Step 11 requires exactly $k=1006$ columns, we add arbitrary columns to reach 1006 total. This adjustment doesn't disrupt coverage (as extra columns don't remove covered checkers) and ensures we meet the exact column count requirement while maintaining residual checker coverage from Step 17.",
                "direct_dependent_steps": [
                    11,
                    18
                ],
                "node": "We can then add arbitrary additional columns to reach exactly $1006$ columns."
            },
            {
                "step_id": 20,
                "edge": "By Step 13's labeling, rows 1–1006 are the selected rows. Thus, any checker in these rows is automatically covered by the row selection—no additional column coverage is needed for them. This is a direct consequence of the row selection strategy in Step 12.",
                "direct_dependent_steps": [
                    13
                ],
                "node": "All checkers in rows $1$ through $1006$ lie in the selected rows."
            },
            {
                "step_id": 21,
                "edge": "Step 17 ensures all residual checkers (outside rows 1–1006) are covered by the columns selected for them, and Step 19 preserves this coverage by only adding columns. Thus, the full column set covers all residual checkers, complementing the row coverage from Step 20.",
                "direct_dependent_steps": [
                    17,
                    19
                ],
                "node": "All checkers outside rows $1$ through $1006$ lie in the selected columns."
            },
            {
                "step_id": 22,
                "edge": "Combining Step 20 (rows 1–1006 cover their checkers) and Step 21 (selected columns cover residual checkers), all checkers are covered by 1006 rows and 1006 columns. This confirms sufficiency for the $R \\leq 1006$ case, fulfilling Step 11's goal for this scenario.",
                "direct_dependent_steps": [
                    20,
                    21
                ],
                "node": "Hence $1006$ rows and $1006$ columns cover all checkers in this case."
            },
            {
                "step_id": 23,
                "edge": "We now consider the alternative case where $R \\geq 1006$ (using Step 14's definition of $R$). This scenario requires a different argument, as the residual checkers exceed the column budget if handled naively. We will show this case leads to a contradiction, proving it cannot occur with 3000 checkers.",
                "direct_dependent_steps": [
                    14
                ],
                "node": "Now suppose instead that $R\\ge1006$."
            },
            {
                "step_id": 24,
                "edge": "With $R \\geq 1006$ residual checkers in Step 23 distributed across 1005 rows (rows 1007–2011 per Step 14), the pigeonhole principle applies: if 1005 rows hold ≥1006 checkers, at least one row must contain ≥2 checkers (since $1005 \\times 1 = 1005 < 1006$). This identifies a dense row in the residual set.",
                "direct_dependent_steps": [
                    14,
                    23
                ],
                "node": "By the pigeonhole principle, among the $1005$ rows $1007$ through $2011$ containing $R\\ge1006$ checkers, at least one row contains at least two checkers."
            },
            {
                "step_id": 25,
                "edge": "Step 12 selected the top 1006 rows by checker count, so each must have at least as many checkers as any row in the residual set (rows 1007–2011). Step 24 shows at least one residual row has ≥2 checkers, implying every selected row has ≥2 checkers. This propagates density from the residual set to the selected rows.",
                "direct_dependent_steps": [
                    12,
                    24
                ],
                "node": "Each of the selected $1006$ rows contains at least as many checkers as any row among rows $1007$ through $2011$ and thus has at least two checkers."
            },
            {
                "step_id": 26,
                "edge": "Since Step 25 guarantees each of the 1006 selected rows has ≥2 checkers, the total in selected rows is ≥$2 \\times 1006 = 2012$. Sanity check: $2 \\times 1006 = 2012$ is exact for minimum density, and 2012 ≤ 3000 (total checkers), so this is plausible without immediate contradiction.",
                "direct_dependent_steps": [
                    25
                ],
                "node": "Hence the total number of checkers in the selected rows is at least $2\\times1006=2012$."
            },
            {
                "step_id": 27,
                "edge": "By Step 14's definition, $R$ is the checker count in rows 1007–2011, and Step 23 assumes $R \\geq 1006$. Thus, the residual rows contain at least 1006 checkers—this is a direct restatement of the case assumption for summation.",
                "direct_dependent_steps": [
                    14
                ],
                "node": "The rows $1007$ through $2011$ contain at least $R\\ge1006$ checkers by definition of $R$."
            },
            {
                "step_id": 28,
                "edge": "Summing Step 26's selected-row minimum (≥2012) and Step 27's residual-row minimum (≥1006) gives total checkers ≥$2012 + 1006 = 3018$. Verification: $2012 + 1006 = 3018$, which exceeds the problem's 3000 checkers.",
                "direct_dependent_steps": [
                    26,
                    27
                ],
                "node": "Therefore the total number of checkers on the board is at least $2012+1006=3018$."
            },
            {
                "step_id": 29,
                "edge": "Step 28's lower bound (≥3018 checkers) contradicts Step 2's fixed total of 3000 checkers. This impossibility means the assumption $R \\geq 1006$ (Step 23) cannot hold for any valid arrangement with 3000 checkers.",
                "direct_dependent_steps": [
                    2,
                    28
                ],
                "node": "This contradicts the total of $3000$ checkers."
            },
            {
                "step_id": 30,
                "edge": "From Step 29's contradiction, the assumption $R \\geq 1006$ must be false. Thus, for all valid arrangements, Step 14's $R$ satisfies $R \\leq 1006$. This reduces all cases to the scenario handled in Steps 15–22, where coverage with $k=1006$ is achievable.",
                "direct_dependent_steps": [
                    29
                ],
                "node": "Therefore the assumption $R\\ge1006$ is false and we must have $R\\le1006$."
            },
            {
                "step_id": 31,
                "edge": "Step 22 confirms sufficiency for $R \\leq 1006$, and Step 30 shows this is the only possible case. Therefore, for every arrangement of 3000 checkers, 1006 rows and 1006 columns suffice to cover all checkers—proving $k=1006$ meets the universal coverage requirement from Step 3.",
                "direct_dependent_steps": [
                    22,
                    30
                ],
                "node": "In all cases, $1006$ rows and $1006$ columns suffice to cover every checker."
            },
            {
                "step_id": 32,
                "edge": "Step 10 established $k \\geq 1006$ as necessary (via the diagonal arrangement), and Step 31 proved $k=1006$ is sufficient. Thus, 1006 is the smallest integer satisfying the problem's condition for all arrangements, confirming the final answer.",
                "direct_dependent_steps": [
                    10,
                    31
                ],
                "node": "The final answer is \\boxed{1006}."
            }
        ]
    }
]
