 DiL: Semi-supervised Continual Learning with DCD + CACE

This repository implements a Semi-supervised Continual Learning framework that integrates DCD and CACE on top of a standard SSCL training pipeline. The code supports CIFAR-10/100, CUB-200, and ImageNet-100, and provides DER and non-DER training paths.


 Method Overview
- CACE: builds composable block-sparse anchor vectors (as an alternative to ETF) to reserve discriminative feature subspaces for incremental classes.
- DCD: discrete contrastive distillation with class-feature selection and discretization; enforces stronger constraints on old classes and weaker constraints on new classes to mitigate forgetting and improve unlabeled-data usage.
- Unlabeled data and replay: combines pseudo-labeling, buffer replay, and consistency training to balance stability and plasticity.

 Project Structure
- `train_semi.py` / `train_semi_der.py`: main entry points for non-DER / DER training.
- `train_cub.py`: entry point for CUB-200 training.
- `utils_incremental/discrete_contrastive_distillation.py`: DCD core module.
- `utils_incremental/CACE.py`: CACE anchor construction.
- `utils_incremental/incremental_train_and_eval_semi.py`: incremental training and evaluation logic.
- `utils_pytorch.py`: shared utilities (including CACE integration).
- Run scripts: `run_cifar.sh`, `run_cub.sh`, `run_imagenet.sh`.
- Docs/notes: `DCD_README.md`, `DCD_IMPLEMENTATION_SUMMARY.md`, `DCD_CIFAR10_vs_CIFAR100_ANALYSIS.md`, etc.

 Requirements
- Python 3.10+
- PyTorch 2.1.x, torchvision 0.16.x
- See `environment.yml` for the full dependency list.
- Recommended (Conda): `conda env create -f environment.yml` and activate the environment.

 Data Preparation
- CIFAR-10/100: use the torchvision downloader or a local cache.
- CUB-200: follow the official directory layout (train/val).
- ImageNet-100: organize data by the split; make sure `--data_dir` points to the extracted root.
- Ensure `--image_size` (32 or 224) matches the dataset.

 Key Arguments (Common)
- Data/model: `--dataset {cifar10,cifar100,cub,imagenet100}`, `--model {resnet18,resnet20,resnet32}`, `--image_size`
- Incremental setup: `--nb_cl_fg` (classes in the first task), `--nb_cl` (new classes per step), `--num_classes`
- Training schedule: `--epochs` (first task), `--epochs_new` (incremental steps)
- Replay & unlabeled: `--buffer_size`, `--include_unlabel`, `--p_cutoff`, `--u_iter`, `--use_ulb_kd`, `--ulb_kd_mode similarity`
- DCD: `--enable_dcd`, `--lambda_dcd`, (optional) `--dcd_top_k_class`, `--dcd_top_k_sample`, `--dcd_alpha`, `--dcd_temperature`
- Other: `--warmup_epochs`, `--random_seed`, `--train_batch_size`, `--test_batch_size`, learning rate and scheduler settings

 Training and Evaluation Notes
- See `run_cifar.sh`, `run_cub.sh`, and `run_imagenet.sh` for example commands (including DCD/CACE settings).
- Use `train_semi_der.py` for DER and `train_semi.py` for non-DER.
- Evaluation / resume: add `--resume --model_path <checkpoint>`; set `--epochs` / `--epochs_new` to 0 for evaluation-only runs.
- Logs and checkpoints are written to `log/` and `checkpoint/` by default.

