# Streaming Video Question Answering Evaluation Framework

This repository contains the implementation for evaluating multimodal models on streaming video question answering tasks on PhoStream for review purposes. All code and data will be publicly released after the necessary review process.

## Configuration

Edit `config/stream_config.yaml` to configure:

1. **Models**: Add your model configurations
   - For local models: specify model path
   - For API models: add API key and base URL

2. **Benchmarks**: Specify paths to your benchmark datasets

3. **Judger**: Configure the LLM judger for subjective question evaluation

## Usage

### Running Inference

Use the provided SLURM scripts:

```bash
# For inference
bash doubao.sh

# For evaluation
bash score.sh
```

## Key Features

- **Streaming Video Processing**: Processes videos in chunks to simulate real-time streaming
- **Multi-process Inference**: Supports parallel processing with multiple workers
- **Flexible Model Integration**: Easy to add new models via configuration
