# Stimuli

This folder contains all experimental stimuli: the master stimuli list, audio recordings, and visual displays.

---

## Contents

```
1_stimuli/
├── README.md                  ← You are here
├── stimuli_list.xlsx          Master spreadsheet (36 items × 6 conditions)
├── audio/                     216 MP3 files (complete sentences)
└── visual/                    Assembled 2×2 displays (1024×768)
```

---

## Stimuli List (`stimuli_list.xlsx`)

Master spreadsheet with one row per trial (216 rows for 36 items × 6 conditions). Columns:

| Column       | Description                                              |
|-------------|----------------------------------------------------------|
| `List`      | List assignment (1–6, Latin square)                      |
| `Item`      | Item number (1–36)                                       |
| `Structure` | Sentence structure: DO (double-object) or PO (prepositional-object) |
| `Stress`    | Prosodic condition: Neutral, Recipient, or Theme         |
| `Sentence`  | Chinese dative sentence ending with "而不是……". The stressed (contrastively focused) referent is highlighted in **red**. |
| `Target`    | Correct continuation after "而不是"                        |
| `Audio_File`| Filename of the corresponding audio stimulus             |
| `Visual_File` | Filename of the corresponding assembled visual display |
| `Loc_TL`    | Image filename at the **Top-Left** quadrant              |
| `Loc_TR`    | Image filename at the **Top-Right** quadrant             |
| `Loc_BL`    | Image filename at the **Bottom-Left** quadrant           |
| `Loc_BR`    | Image filename at the **Bottom-Right** quadrant          |

The `Loc_*` columns reference individual referent images by filename (see naming convention below; individual images available upon request). The `Visual_File` column references assembled displays in `visual/`.

---

## Audio Stimuli (`audio/`)

216 target audio files in MP3 format (converted from 44.1 kHz mono WAV recordings). Recorded by a male native Beijing Mandarin speaker with phonetics training (Shure SM58 + Focusrite Scarlett Solo, Adobe Audition).

The provided audio files are **complete sentences including the target word** (e.g., "他买给了医生鸡蛋，而不是水果"), used in the multimodal LLM experiment. Audio for the cloze-in-VWP task (human experiment) is identical but **truncated after "but not"** (*而不是*), with the target word removed (e.g., "他买给了医生鸡蛋，而不是……"). The truncated version is not included due to the supplementary materials size limit.

> **Note:** Filler audio files are not included in the supplementary materials.

### Naming convention

```
{Item}_{Structure}_{Stress}_{TargetType}.mp3
```

| Component     | Values                           | Description                                    |
|--------------|----------------------------------|------------------------------------------------|
| `Item`       | 1–36                             | Item number                                    |
| `Structure`  | `DO`, `PO`                       | Double-object vs. prepositional-object dative  |
| `Stress`     | `Neutral`, `Recipient`, `Theme`  | Prosodic stress condition                      |
| `TargetType` | `R`, `T`                         | Target entity type: **R**ecipient or **T**heme |

**TargetType logic:**
- **Recipient stress** → target is always a person → `_R`
- **Theme stress** → target is always an object → `_T`
- **Neutral stress** → target type varies by item (determined by cloze norming); `_R` if a person, `_T` if an object

**Example (Item 1: 他买给了医生鸡蛋，而不是……):**

| Filename               | Structure | Stress    | Target |
|------------------------|-----------|-----------|--------|
| `1_DO_Neutral_T.mp3`   | DO        | Neutral   | 水果 (fruit, Theme)     |
| `1_DO_Recipient_R.mp3` | DO        | Recipient | 护士 (nurse, Recipient) |
| `1_DO_Theme_T.mp3`     | DO        | Theme     | 水果 (fruit, Theme)     |
| `1_PO_Neutral_T.mp3`   | PO        | Neutral   | 水果 (fruit, Theme)     |
| `1_PO_Recipient_R.mp3` | PO        | Recipient | 护士 (nurse, Recipient) |
| `1_PO_Theme_T.mp3`     | PO        | Theme     | 水果 (fruit, Theme)     |

---

## Visual Stimuli (`visual/`)

### Assembled visual displays

Each trial presents 4 images simultaneously in a 2×2 grid. Positions are counterbalanced across Latin-square lists. These correspond to the `Visual_File` column in `stimuli_list.xlsx`.

The provided displays are those used in **Experiment 1 (human eye-tracking)**. Canvas: **1024 × 768 px**; each image: **300 × 300 px**.

Quadrant centers:
- Top-Left (256, 192) · Top-Right (768, 192)
- Bottom-Left (256, 576) · Bottom-Right (768, 576)

Follows standard VWP display conventions (cf. Corps, Liao & Pickering, 2023; Ito, Pickering & Corley, 2018).

For **Experiment 2 (multimodal LLM)**, displays were regenerated programmatically from the same individual images with adjusted dimensions optimized for Qwen2.5-Omni-7B: canvas **1008 × 756 px**, each image **308 × 308 px** (multiples of 28 pixels, matching the vision encoder's tokenization grid). Bounding boxes (pixel coordinates [left, top, right, bottom]): TL [84, 28, 392, 336], TR [616, 28, 924, 336], BL [84, 420, 392, 728], BR [616, 420, 924, 728].

### Individual referent images

Individual referent images (216 PNG files, 2400 × 2400 px, transparent background) sourced from a commercial clipart library (Clipart.com) are not included due to the supplementary materials size limit. Available upon request.

#### Naming convention (for reference)

```
T{Item}{EntityName}_{Role}_{EntityType}.png
```

| Component     | Values              | Description                                        |
|--------------|---------------------|----------------------------------------------------|
| `Item`       | 1–36                | Item number                                        |
| `EntityName` | e.g., `doctor`, `egg` | English name of the depicted entity              |
| `Role`       | `M`, `T`, `C`       | Experimental role (see below)                      |
| `EntityType` | `R`, `T`            | Entity type: **R**ecipient (person) or **T**heme (object) |

#### Role definitions

| Role | Full name   | Description                                                                       |
|------|-------------|-----------------------------------------------------------------------------------|
| `M`  | Mentioned   | Mentioned referent — the recipient or theme explicitly named in the sentence (always present on screen) |
| `T`  | Target      | Correct alternative after "but not" (*而不是*) — the entity participants should fixate          |
| `C`  | Competitor  | The other alternative entity that is *not* the target. If the alternative recipient is the target (`T_R`), then the alternative theme is coded as competitor (`C_T`); if the alternative theme is the target (`T_T`), then the alternative recipient is coded as competitor (`C_R`). |

#### Six images per item

Each item has **4 unique entities** represented by **6 image files**. The alternative recipient and alternative theme each have two versions (Target and Competitor), because their role switches depending on the stress condition:

| #  | Role | Type | Description                           | Example (Item 1)       |
|----|------|------|---------------------------------------|------------------------|
| 1  | M    | R    | Mentioned recipient                   | `T1doctor_M_R.png`     |
| 2  | M    | T    | Mentioned theme                       | `T1egg_M_T.png`        |
| 3  | T    | R    | Alternative recipient as target       | `T1nurse_T_R.png`      |
| 4  | C    | R    | Alternative recipient as competitor   | `T1nurse_C_R.png`      |
| 5  | T    | T    | Alternative theme as target           | `T1fruit_T_T.png`      |
| 6  | C    | T    | Alternative theme as competitor       | `T1fruit_C_T.png`      |

Images #3 and #4 depict the same entity (护士/nurse); #5 and #6 depict the same entity (水果/fruit). The T/C coding reflects the cross-category relationship:
- Under **Recipient stress**: the alternative recipient is the target (`T1nurse_T_R.png`), and the alternative theme is the competitor (`T1fruit_C_T.png`).
- Under **Theme stress**: the alternative theme is the target (`T1fruit_T_T.png`), and the alternative recipient is the competitor (`T1nurse_C_R.png`).
