## Project Structure

### Core Modules (`myrag/`)
The `myrag` folder contains the core implementation of the HGMem framework:

- **`base.py`**: Defines base classes, type definitions, and data structures including `QueryParam` for query configuration, `TextChunkSchema` for text chunks, and abstract storage interfaces
- **`myrag.py`**: Main RAG class (`MyRAG`) that orchestrates the entire hypergraph-based memory system, integrating all components for document insertion and querying
- **`llm.py`**: LLM interface functions supporting multiple backends (OpenAI API, vLLM, HuggingFace models, Ollama) for text generation and completion
- **`memory.py`**: Memory point management including creation, evolution, and postprocessing of hypergraph memory structures
- **`operate.py`**: Core operations including text chunking, entity extraction, node/chunk indexing, and query execution (naive, direct, and HGMem query modes)
- **`prompt.py`**: Prompt templates and formatting for various LLM interactions (entity extraction, memory evolution, query generation, etc.)
- **`storage.py`**: Storage backend implementations including `JsonKVStorage` (key-value), `NanoVectorDBStorage` (vector embeddings), `NetworkXStorage` (graph), and `HypergraphStorage`
- **`utils.py`**: Utility functions for text processing, hashing, caching, embedding operations, and context building
- **`rag_config.yaml`**: Configuration file for RAG system parameters

### Scripts (`scripts/`)
Entry point scripts for running HGMem:

- **`build_graph.py`**: Builds the hypergraph structure from a text corpus by extracting entities and creating memory points
- **`graph_run.py`**: Main execution script for running HGMem queries on datasets with various configuration options
- **`inspection.py`**: Utilities to inspect and analyze raised questions and their associated entities/chunks
- **`manipulate_graph.py`**: Tools for manipulating and analyzing existing hypergraph structures
- **`utils.py`**: Shared utility functions for scripts (storage loading, text processing, graph operations)
- **`run_rag.sh`**: Shell script for batch execution of RAG experiments

### Evaluation (`evaluate/`)
Scripts for evaluating HGMem performance:

- **`evaluate_by_scoring.py`**: Evaluation framework using scoring metrics to assess answer quality

### Data (`data/`)
Dataset storage and management:

- **`narrative_qa/`**: NarrativeQA dataset with document corpus and questions
- **`created_data/`**: Generated data including processed documents, extracted entities, and created questions

### Other Files
- **`images/`**: Image assets for README documentation
- **`requirements.txt`**: Python package dependencies

----

## Installation

```sh
conda create -n hgmem python=3.11
conda activate hgmem
pip install -r requirements.txt
```
Initialize the environmental variables and activate the environment:

```sh
export CUDA_VISIBLE_DEVICES=0,1,2,3
export HF_HOME=<path to Huggingface home directory>

conda activate HGMem
```

## Quick Start
### API Usage (GPT-4o)

This simple example will illustrate how to use `HGMem` with any vLLM-compatible locally deployed LLM.


1. Build Graph: 

```sh
export OPENAI_API_KEY="your_openai_api_key_here"
epxort OPENAI_BASE_URL="your_openai_base_url_here"

python build_graph.py --domains Long \
                      --source_texts narrative_qa \
                      --llm_model_name gpt-4o
                  
```

2. Run HGMem:
```sh
export OPENAI_API_KEY="your_openai_api_key_here"
epxort OPENAI_BASE_URL="your_openai_base_url_here"

python graph_run.py --domains Long \
                     --dataset narrative_qa \
                     --graph_mode single \
                     --rag_mode HGMem \
                     --source_type chunks100 \
                     --question_type traditional \
                     --llm_model_name gpt-4o
```

### Local Deployment (vLLM)

This simple example will illustrate how to use `HGMem` with any vLLM-compatible locally deployed LLM.

1. Run a local [vLLM server] with specified GPUs (leave enough memory for embedding model).

```sh
export CUDA_VISIBLE_DEVICES=0,1
export VLLM_WORKER_MULTIPROC_METHOD=spawn
export HF_HOME=<path to Huggingface home directory>

conda activate HGMem  # vllm should be in this environment

vllm serve Qwen2.5-32B --tensor-parallel-size 2 --max_model_len 4096 --gpu-memory-utilization 0.95 
```

2. Build Graph(adjust the llm_model_function in `build_graph.py` to use local vLLM): 

```sh
python build_graph.py --domains Long \
                      --source_texts narrative_qa \
                      --llm_model_name Qwen2.5-32b
```

3. Run HGMem(adjust the llm_model_function in `graph_run.py` to use local vLLM):
```sh
python graph_run.py --domains Long \
                     --dataset narrative_qa \
                     --graph_mode single \
                     --rag_mode HGMem \
                     --source_type chunks100 \
                     --question_type traditional \
                     --llm_model_name Qwen2.5-32b 
```


### Custom Datasets

To setup your own custom dataset for evaluation, follow the format and naming convention shown in `data/narrative_qa/domains/Long/0`. Organize your query corpus according to the query file `data/created_data/narrative_qa/domains/Long/0`, be sure to also follow our naming convention.

The corpus and optional query files should have the following format:

#### Retrieval Corpus JSON

```json
[
  [
    "he Project Gutenberg eBook, The Last Chronicle of Barset, by Anthony TrollopeThis eBook is for the use of anyone anywhere at no cost and with\nalmost no restrictions whatsoever.  You may copy it, rg.org"
  [
    "narrative",
    "test",
    "e0bc784c28ad8b8c"
  ]
  ],
  ...
]
```
#### Query JSONL

```json
{"question": "HOW DOES ADAM MORE ARRIVE IN THE \"LOST WORLD\"?", "reference_answer": [" THROUGH A VOLCANIC SUBTERRANEAN TUNNEL", "Shipwrecked"]}
{"question": "WHAT WAS ADAM MORE'S NATIONALITY?", "reference_answer": ["BRITISH", "British "]}
```
