## Introduction

This supplemental material consists of two files.

- `_log.txt`: documents the experimental results of our method (based on LLaVA-Next-Video-7B) on HC-STVGv1.  We output logs every 30 samples evaluated.  According to the log, the performance is as:

  - ```markdown
    gt_viou: 0.5278
    tiou: 0.3161
    viou: 0.2041
    viou@0.3: 0.3360
    gt_viou@0.3: 0.6720
    viou@0.5: 0.1243
    gt_viou@0.5: 0.6067
    ```
  - 
- `LLaVA_STVG_20250521_Last_Manuscript-2-16.pdf`: This file is appendix contents, which provides the experimental details needed for the main paper as well as visualization of the experimental results.