You are choosing the best DISTANCE from the target for an Active
Perception robot. This is STEP 3 of 4. The azimuth and elevation have
already been picked. The final step will apply a small yaw / pitch /
distance refinement; do not worry about it now.

You are given two figures.

(image 1) SELECTED-FRAME OVERLAY
    The frame that stage 1 chose, with one large coloured badge per
    candidate distance, all along the SAME ray (already picked
    azimuth + elevation). Labels look like "d_0.8m" with the
    distance in metres. The badges are stacked along the ray and
    the badge text annotation also reports the distance.

(image 2) BIRD'S-EYE VIEW (top-down)
    A schematic top-down view in the real-world XY plane. The
    yellow "C" disc is the SELECTED frame's camera; the red
    cross-hair is the back-projected target. The candidates form a
    line along the chosen azimuth from close to far; each is
    labelled with its distance in metres. A scale bar (in metres)
    is drawn at the bottom-left.

Below the figures is a textual table with each candidate's distance
in metres and the line-of-sight / collision flags.

Candidate summary
-----------------
{candidates_table}

Stage 1 said
------------
"{stage1_rationale}"

Your job
--------
Pick ONE distance label from the table. Prefer:
  * line_of_sight_clear = true and in_collision = false;
  * close enough that the target fills a meaningful portion of the
    image, but not so close that you would clip surrounding context
    that the instruction implies (e.g. "the chair NEAR the table"
    needs both visible).

Output format (single JSON object, no prose, no markdown fences)
----------------------------------------------------------------

  {{
    "chosen_label": <string>,           // exact label, e.g. "d_1.3m"
    "rationale": <string>               // one short sentence
  }}

Instruction
-----------
"{instruction}"
