# Dynamic Stratified Contrastive Learning with Upstream Augmentation for MILP Branching

## Overview

**Mixed Integer Linear Programming (MILP)** is a fundamental class of NP-hard problems that has received great attention from both academia and industry. The **Branch-and-Bound (B&B)** algorithm remains the dominant approach for solving MILPs, where the *branching policy* plays a crucial role in determining efficiency.

Recent advances have applied **neural-based learning frameworks** to improve branching decisions, but existing methods still suffer from:
- **Semantic variation across tree depths**
- **Scarcity of upstream nodes**
- **High cost of collecting strong branching samples**

To overcome these limitations, we propose **SC-MILP**, a *Dynamic Stratified Contrastive Training Framework* for MILP branching.

---

## 🔍 Key Idea

SC-MILP groups B&B nodes based on their **feature distributions** and trains a **GCNN-based discriminative model** to progressively separate nodes across groups.  
This enables the model to learn **finer-grained and depth-aware node representations**.

Additionally, to address **data scarcity and imbalance** in upstream nodes, we introduce an **upstream-augmented MILP derivation procedure** that generates both:
- **Theoretically equivalent** MILP instances, and  
- **Perturbed instances** for better generalization.

SC-MILP effectively models subtle semantic differences between nodes, leading to:
- Higher **branching accuracy**
- Improved **solving efficiency**
- Better **generalization** to unseen instances

---

## 🚀 Installation

### 1. Install SCIP Solver
First, install the **SCIP solver (v7.0.1)**.  
Please follow the installation guide provided by [learn2branch](https://github.com/ds4dm/learn2branch/blob/master/INSTALL.md).

### 2. Install PySCIPOpt
Then, install **PySCIPOpt**.  
We use the version available at:  
👉 [https://github.com/Wenbo11/PySCIPOpt/tree/scip700](https://github.com/Wenbo11/PySCIPOpt/tree/scip700)

### 3. Install Python Dependencies
```bash
pip install -r requirements.txt
```
---

## 📊 Instance Generator

You can generate MILP problem instances using the provided generator script.

### 1. Basic Usage
```bash
python instance_generator.py setcover
```
### 2. Supported Problem Types
Replace `setcover` with one of the following:
- `setcover` → Set Covering Problem
- `auctions` → Combinatorial Auctions
- `facilities` → Facility Location Problem
- `indset` → Independent Set Problem

Each instance type has configurable parameters (number of rows, columns, density, etc.) for scalable dataset creation.

---

## 🏋️ Training

Train the SC-MILP model using distributed learning (example: 2 GPUs):
```bash
python -m torch.distributed.launch --nproc_per_node=2 behavior_cloning_cons.py \
--ins_type setcover_400r_1000c_0.05d_100mc_0se \
--code_id new_20 \
--weight_type 1 \
--lr 1e-3 \
--shrink_lr 0.3 \
--loss ce \
--batch_size 16 \
--debug no \
--alpha 0.1 \
--epoch_num 400
```
### Key Arguments

| Argument      | Description |
|--------------|-------------|
| `--ins_type` | Instance type (e.g., `setcover_400r_1000c_0.05d_100mc_0se`) |
| `--code_id`  | Model version or identifier |
| `--weight_type` | Weight initialization type |
| `--lr`       | Learning rate |
| `--shrink_lr` | Learning rate decay factor |
| `--loss`     | Loss function (e.g., `ce` = cross-entropy) |
| `--alpha`    | Auxiliary loss coefficient |
| `--epoch_num` | Number of training epochs |
| `--batch_size` | Training batch size |

---

## 📈 Evaluation

After training, evaluate the model performance using:
```bash
bash eval.sh setcover new_20_topk 111 140 140 140
```
- The first argument (`setcover`) specifies the problem type
- The second argument (`new_20_topk`) corresponds to the model code ID used in training
- The remaining numbers indicate the evaluation range or checkpoints

---

## 🧩 Supported Problem Types

| Problem | Keyword | Description |
|---------|---------|-------------|
| Set Covering | `setcover` | Classic covering optimization problem |
| Combinatorial Auctions | `auctions` | Winner determination in combinatorial auctions |
| Facility Location | `facilities` | Location and allocation optimization |
| Independent Set | `indset` | Graph-based combinatorial optimization |

---

## 📝 Notes

- Ensure SCIP and PySCIPOpt versions are compatible (SCIP 7.0.1 + PySCIPOpt scip700 branch)
- The training script uses PyTorch Distributed for multi-GPU learning
- Hyperparameters such as `lr`, `alpha`, and `epoch_num` can be adjusted for different scales or datasets
- Generated instances and model checkpoints are stored under corresponding directories (`data/`, `logs/`, `models/`)

7️⃣ Citation

If you use SC-MILP in your research, please consider citing this project (citation information coming soon).
