# Simulation Code for Collaborative Online Survival Analysis

This repository contains simulation code for comparing online learning methods for Cox proportional hazards models.

## Methods Compared

- **Oracle**: Full-data Cox regression (benchmark)
- **COLSA**: Collaborative Operation of Linked Survival Analysis (proposed)
- **Meta**: Meta-analysis with inverse-variance weighting
- **Online**: Online Cox model with piecewise constant hazard
- **SGD**: Stochastic gradient descent for Cox model

## Project Structure

```
.
├── COLSA/              # COLSA R package (source)
├── online/
│   └── online_update.R # Online Cox model implementation
├── install.R           # Install R dependencies
├── generate.R          # Generate simulation data
├── run.R               # Run simulations for each method
├── analyze.R           # Analyze results and generate figures/tables
├── run.sh              # Shell script to run full pipeline
└── real-data.R         # Real data application
```

## Requirements

- R (>= 4.0)
- Required packages: `survival`, `MASS`, `simsurv`, `foreach`, `doSNOW`, `parallel`, `ggplot2`, `dplyr`, `tidyr`, `patchwork`, `xtable`, `adagio`, `coxphSGD`

## Quick Start

### Option 1: Run full pipeline

```bash
bash run.sh
```

### Option 2: Run step by step

```bash
# 1. Install dependencies
Rscript install.R

# 2. Generate simulation data
# Usage: Rscript generate.R <K> <n_small>
Rscript generate.R 200 300

# 3. Run simulation for each method
# Usage: Rscript run.R <K> <n_small> <method>
Rscript run.R 200 300 oracle
Rscript run.R 200 300 colsa
Rscript run.R 200 300 meta
Rscript run.R 200 300 online_bc
Rscript run.R 200 300 sgd

# 4. Analyze results
# Usage: Rscript analyze.R <n_small>
Rscript analyze.R 300
```

## Parameters

- `K`: Number of sites/batches (e.g., 6, 20, 50, 200)
- `n_small`: Sample size for small batches (batches 7 to K), default: 100
- `n_replication`: Number of simulation replications (default: 100)

## Output

- `data/sim_K=<K>_n=<n_small>_seed=<i>.csv`: Simulated datasets
- `results/sim_K=<K>_n=<n_small>_method=<method>.csv`: Estimation results
- `results/fig_combined.pdf`: Combined figure comparing methods
- `results/simulation_tables.tex`: LaTeX tables
