## Bridging On-Device and Cloud LLMs for Collaborative Reasoning

[![python](https://img.shields.io/badge/Python_3.10-306998?logo=python&logoColor=FFD43B)](https://www.python.org/downloads/release/python-31012/)
[![License: MIT](https://img.shields.io/badge/license-MIT-750014.svg)](https://opensource.org/licenses/MIT) 

---
## 🔥 Our Framework

TL, DR: In this repo, we provide the implementation of our work **Bridging On-Device and Cloud LLMs for Collaborative Reasoning**.

<div align="center">
    <img src="figures/Overview.png" alt="overview" style="width:100%;"/>
</div>


## 🖥️ Prerequisites

Install the required packages via:
```bash
conda create -n collaborative_env python==3.10
conda activate collaborative_env
# Make sure you have activated verl conda env
# Run with FSDP
USE_MEGATRON=0 bash scripts/install_vllm_sglang_mcore.sh

# Install verl. This is different from the original verl repo from ByteDance Seed.
cd verl
pip install --no-deps -e .

```


## 🗂️ Folder Structure
```
Device_Cloud_Collaboration/
│─── README.md
├─── data
├─── eval
├─── examples
├─── scripts
├─── verl
```
- **`verl`**: Contains the primary codebase. Revised based on the verl repo from ByteDance Seed
- **`examples`**: Contains the runing scripts
- **`eval`**: For evaluation

## 🏃‍♂ Run Code

Run our framework with the following command:
```bash
./example/gapg_trainer/run_llama3_2_3b_math.sh
./example/gapg_trainer/run_qwen2_5_1_5b_math.sh
```
This code runs with 4 NVIDIA A100 GPUs in parallel