# NCV (CLIP-Based)

## Introduction
This repository provides a framework for evaluating SpLiCE embeddings using linear and non-linear classifiers, and also trains a self-explanatory multi-agent classifier, based on the Merlin-Arthur classifier on these embeddings. This classifier deploys multiple agents working both collaboratively and adversarally to enhance interpretability and robustness in classification tasks. SpLiCE (Sparse Linear Concept Extraction) is utilized to extract meaningful visual concepts from images, and the multi-agent system chooses subsets of these concepts, ensuring that both the extraction and classification processes are transparent and efficient. 

<!-- <div align="center">
[Consider adding a conceptual diagram here]
</div> -->

## Table of Contents
- [Introduction](#introduction)
- [Getting Started](#getting-started)
  - [Installation](#installation)
- [Project Structure](#project-structure)
- [Usage](#usage)
  - [Precomputing Embeddings](#precomputing-embeddings)
  - [Pretraining a Classifier](#pretraining)
  - [Stochastic Frank-Wolfe (SFW) Training](#stochastic-frank-wolfe-training)
  - [Debug Mode](#debug-mode)
- [Available Models](#available-models)
- [Configuration](#configuration)

## Getting Started


2. Create and activate the conda environment:
```bash
conda env create --file env.yml -n concept-learning
conda activate concept-learning
```
Execute the following command to install SpLiCE into your environment:

```bash
pip install git+https://github.com/AI4LIFE-GROUP/SpLiCE.git
```
(I have installed SpLiCE manually by cloning their repository into a separate folder. So if the above command does not work, try their [installation guide](https://github.com/AI4LIFE-GROUP/SpLiCE?tab=readme-ov-file#installation))

## Project Structure
```
MerlinArthur-SpLiCE/
├── src/
│   ├── data/
│   │   └── precompute_embeddings.py
│   ├── models/
│   │   └── classifier.py
│   ├── trainer/
│   │   └── trainer_framework.py
│   ├── merlin_arthur_framework/
│   │   ├── feature_selectors.py
│   │   └── stochastic_frank_wolfe.py
│   ├── config/
│   │   ├── config.py
│   │   ├── config_dataclass.py
│   │   └── argparser.py
│   └── main.py
```

## Usage

### Precomputing Embeddings
Before training, you need to precompute SpLiCE embeddings:

```bash
python src/data/precompute_embeddings.py \
    --dataset_root ../../SCRATCH/datasets \
    --save_dir ../../SCRATCH/datasets/embeddings_cifar100 \
    --l1_penalties 0.1
```
The framework currently supports the following datasets: `CIFAR-100`, `ImageNet`. Make sure that the dataset directory and any required preprocessing steps are correctly specified when running the respective commands for precomputing embeddings and training.

### Pretraining
Train a linear classifier on the embeddings:

```bash
python src/main.py \
    --epochs 10 \
    --approach regular \
    --dataset cifar100 \
    --batch_size 32 \
    --model linear \
    --lr 0.001 \
    --l1_penalty_splice 0.100
```
In all training settings, you can choose to include `--wandb` as an argument to start logging with _Weights & Biases_. Please make sure that you are logged into your wandb account locally beforehand.

### Stochastic Frank-Wolfe Training
```bash
python src/main.py \
    --epochs 10 \
    --approach sfw \
    --dataset cifar100 \
    --batch_size 32 \
    --model linear \
    --lr 0.001 \
    --l1_penalty 0.100 \
    --mask_size 100 \
    --lr_merlin 0.01 \
    --l1_penalty_coefficient 0.01 \
    --pretrained_model
```
This approach first loads a pretrained model (trained using the regular approach), then applies SFW feature selection to identify the most important features.


### Debug Mode
For quick testing, use debug mode:

```bash
python src/main.py \
    --debug \
    --epochs 10 \
    --approach regular \
    --dataset cifar100 \
    --batch_size 32 \
    --model linear
```

### Available Models
The framework includes several classifier architectures:
- `LinearClassifier`: Simple linear layer for baseline evaluation
- `LowRankLinearClassifier`: Simple linear layer with less parameter and configureable rank (default rank=32)
- `MLPClassifier`: Multi-layer perceptron with configurable hidden dimensions
- `NonLinearClassifier`: Non-linear classifier with layer normalization and GELU

### Configuration
Configuration is handled through command-line arguments and dataclasses:
- Training configuration (epochs, approach, debug mode, etc.)
- Dataset configuration (root directory, batch size, workers, etc.)
- Model configuration (architecture, learning rate, weight decay, etc.)
- Boolean flags (save model, use AMP, CUDA benchmark, etc.)
- Feature Selector configuration (mask size, SFW step size, etc.)
