# LMLM  
**Pre-training Large Memory Language Models with Internal and External Knowledge**

This repository contains the code for training and evaluating Large Memory Language Models (LMLMs) that interleave internal and external knowledge via entity-level lookups. 

---
## 📦 Release

Pretrain Data, and model checkpoints will be released soon.

---

## 🛠 Installation

### 1. Install this package
```bash
pip install -e .
````

### 2. Install required dependencies

Clone and install the following external libraries:

* [TRL (Transformers Reinforcement Learning)](https://github.com/huggingface/trl.git)
* [LightEval (Lightweight Evaluation Framework)](https://github.com/huggingface/lighteval.git)

(Optional for machine unlearning benchmarks):

* [TOFU](https://github.com/locuslab/open-unlearning.git)

(Optional for Factscore benchmarks):

* [FactScore](https://github.com/shmsw25/FActScore.git)
---

## ⚙️ Setup

Set API credentials for tools like OpenAI and Weights & Biases:

```bash
./scripts_example/account
```

---

## 📚 Dataset Annotation

### Annotate seed dataset using GPT-4

```bash
bash ./scripts_example/annotate/annotate_gpt4.sh
```


### Annotate pretraining dataset using a fine-tuned model

```bash
bash ./scripts_example/annotate/annotate_llama-ft-lora.sh
```


---

## 🧪 Training

### Fine-tune annotation models

```bash
bash ./scripts_example/train/ft-llama8b.sh
```

### Pre-train LMLM

```bash
bash ./scripts_example/train/pretrain.sh
```

---

## 📊 Evaluation

### Evaluate perplexity on a given dataset

```bash
bash ./scripts_example/eval/eval_dataset.sh
```

### Evaluate general NLU tasks

```bash
bash ./scripts_example/eval/eval_pretrain_NLUtask.sh
```

### Evaluate factuality with FactScore

```bash
bash ./scripts_example/eval/eval_factscore.sh
```

### Evaluate knowledge completion with T-REx

```bash
bash ./scripts_example/eval/eval_loss_trex.sh
```

---

## 📁 Project Structure (Optional Overview)

```text
anonymous_code_LMLM/
├── README.md                   # Project documentation
├── requirements.txt            # Python dependencies
│
├── configs/                    # Config files for training, eval, tokenizer
│   ├── accelerate/             # Multi-GPU training configs
│   ├── eval/                   # Eval set config
│   ├── llama*/                # Model-specific configs
│   └── tiny-llama*/           # Tokenizer + architecture configs
│
├── prompts/                   # Prompt templates used in training/eval
│
├── experiment/eval/           # Evaluation scripts (FactScore, T-REx, etc.)
│
├── scripts_example/           # Example shell scripts for annotation, training, and evaluation
│   ├── annotate/
│   ├── train/
│   ├── eval/
│   └── trl/
│
├── src/memgpt/                # Source code
│   ├── annotate.py            # Dataset annotation logic
│   ├── annotators.py          # Annotator model definitions
│   ├── data_manager.py        # Dataset loading and chunking
│   ├── model.py               # Core LMLM model wrapper
│   ├── utils.py               # General utilities
│   ├── database/              # DB lookup utilities
│   └── trl/                   # TRL-based training modules
│       ├── pretrain.py        # Pretraining logic
│       ├── sft.py             # Supervised fine-tuning
│       └── utils/             # Model loading, metrics, masking, etc.

```




