# Multi-View Video Quality Assessment Scripts

This repository contains a collection of Python scripts for evaluating the quality of multi-view videos, comparing them against ground-truth originals. The metrics cover various aspects including feature similarity, temporal consistency, and cross-view consistency.

## Prerequisites

Before running these scripts, ensure you have the necessary Python packages installed. You can typically install them using pip:

```bash
pip install torch torchvision torchaudio
pip install opencv-python numpy tqdm
pip install ftfy regex
pip install git+https://github.com/openai/CLIP.git
pip install lpips
pip install timm # For DINOv2 models
# For FVD, pytorchvideo is recommended
pip install pytorchvideo
```

For certain scripts, additional setup is required:
- **E-Warp Calculator**: Requires cloning the RAFT repository. The script will attempt to do this automatically.
  ```bash
  git clone https://github.com/princeton-vl/RAFT.git raft_repo
  ```

## Data Structure

Most scripts assume the following directory structure:
- `gt/`: Contains the ground-truth videos (e.g., `head.mp4`, `right_hand.mp4`, `left_hand.mp4`).
- `path/to/generated/`: A directory containing the enhanced or generated videos with the same filenames.

---

## Available Metrics

This collection includes a variety of metrics to assess multi-view video quality from different perspectives.

### 1. Feature Signal-to-Noise Ratio (SNR)
- **File**: `video_snr_calculator.py`
- **Description**: Calculates the Signal-to-Noise Ratio (SNR) of video features for a group of videos. Signal is defined as the variance of mean features across different videos (inter-video diversity), while noise is the average variance of features within each video (intra-video consistency). A higher SNR indicates that the feature extractor is more sensitive to the content difference between videos than to the temporal variations within a single video.
- **Usage**:
  ```bash
  python video_snr_calculator.py --group_dirs path/to/your/videos --group_labels MyVideos --model_name inceptionv3
  ```

### 2. Temporal Consistency Metrics
Metrics in this category evaluate the smoothness and coherence of video frames over time.

#### E-Warp Temporal Consistency
- **File**: `video_ewarp_calculator.py`
- **Description**: Calculates E-Warp, a temporal consistency metric based on the L1 difference between a frame and the previous frame warped using RAFT optical flow.
- **Usage**:
  ```bash
  python video_ewarp_calculator.py --enh_dir path/to/generated/
  ```

#### Temporal LPIPS
- **File**: `video_lpips_calculator.py`
- **Description**: Computes temporal LPIPS, measuring perceptual similarity between adjacent frames in a video to evaluate temporal consistency.
- **Usage**:
  ```bash
  python video_lpips_calculator.py --enh_dir path/to/generated/
  ```

#### Temporal LPIPS for Picture Generation
- **File**: `video_lpips_calculator_for_pict.py`
- **Description**: Computes temporal LPIPS over progressively larger initial segments of videos (e.g., first 30 frames, first 60, etc.). Useful for analyzing how consistency evolves over time.
- **Usage**:
  ```bash
  python video_lpips_calculator_for_pict.py --enh_dirs path/to/generated1,path/to/generated2
  ```

#### Temporal Consistency Jitter (TCJ)
- **File**: `video_temporal_consistency_jitter_calculator.py`
- **Description**: Computes Temporal Consistency Jitter (TCJ) by measuring the variance of CLIP feature similarity between consecutive frames.
- **Usage**:
  ```bash
  python video_temporal_consistency_jitter_calculator.py --gt gt/ --gen path/to/generated/
  ```

### 3. Cross-View Consistency Metrics
These metrics assess the coherence and alignment of content across different camera views.

#### Cross-View Feature Consistency (CVFC)
- **File**: `video_cross_view_consistency_calculator.py`
- **Description**: Measures the similarity of CLIP features across multiple views (head, left_hand, right_hand) at corresponding timestamps.
- **Usage**:
  ```bash
  python video_cross_view_consistency_calculator.py --gt gt/ --gen path/to/generated/
  ```

#### Multi-View Depth Consistency (MVDC)
- **File**: `video_mvdc_calculator.py`
- **Description**: Computes Multi-View Depth Consistency (MVDC) by comparing MiDaS-estimated depth maps across different camera views.
- **Usage**:
  ```bash
  python video_mvdc_calculator.py --gt gt/ --gen path/to/generated/
  ```

### 4. Distribution and Feature Similarity Metrics
This group of metrics compares the generated videos against ground-truth videos in terms of feature distributions and content similarity.

#### CLIP Score Calculator
- **File**: `video_clipscore_calculator.py`
- **Description**: Computes various CLIP-based scores including inter-video similarity, intra-video self-similarity, and text-alignment between original and enhanced videos.
- **Usage**:
  ```bash
  python video_clipscore_calculator.py --orig_dir gt/ --enh_dir path/to/generated/ --text_prompts "a descriptive prompt for the video"
  ```

#### Feature Distribution Comparison
- **File**: `video_dist_compare_aggregate.py`
- **Description**: A comprehensive tool to compare feature distributions between groups of videos using various models (InceptionV3, CLIP, DINOv2) and metrics (FID, JSD, etc.).
- **Usage**:
  ```bash
  python video_dist_compare_aggregate.py --group_dirs gt/ --group_labels GroundTruth --group_dirs path/to/generated/ --group_labels Generated --model_name inceptionv3
  ```

#### Frechet Inception Distance (FID)
- **File**: `video_fid_calculator.py`
- **Description**: Computes the Frechet Inception Distance (FID) between frames sampled from original and enhanced videos.
- **Usage**:
  ```bash
  python video_fid_calculator.py --orig_dir gt/ --enh_dir path/to/generated/
  ```

#### Frechet Video Distance (FVD)
- **File**: `video_fvd_calculator.py`
- **Description**: Computes the Frechet Video Distance (FVD) between original and enhanced videos using a pre-trained I3D feature extractor.
- **Usage**:
  ```bash
  python video_fvd_calculator.py --orig_dir gt/ --enh_dir path/to/generated/
  ```