# AuroraCap

## Annotation Steps
- Sample frames
- Step 1: structured caption generation
- Step 2: detailed caption generation

## Modifications in reproduction
- ⚠️Paper did not report the sample strategy. We sample at 1 fps, or fixed 32 frames if longer than 32s
- ⚠️Add restriction to the output format in the prompt of Step 2